Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4631 |
Symbol | |
ID | 5902093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5009634 |
End bp | 5011343 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641565150 |
Product | hypothetical protein |
Protein accession | YP_001686249 |
Protein GI | 167648586 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.457948 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATC AAGTCCATTA CGAGCTGTTC GTGCGCCGCA AGCCCGGCGC GCAATGGACG CTCGACATGG CGACGGAGGT CCGCACCCAC GCCCTGCAGA CCGCCGAGGA GGCGCTGGAG CAGGGGCGAG CGATCGCCGT GCGGGTCAGC AAGGAAACCC TCGACAACGA AACCCGCGAA TACAAGTCGA TCTCGATCTT CACCAAGGGC CAGGTCGACG GCGGCAAGGC CAAGAAGGTG CAGGAAGACC TGGACCCGCT GTGCGTCCAG CCGTCCGACC TCTACACCGC CCACGCCCGC GACCGCATCG GCCGGCTGCT GGAAGGCTGG CTGGCCCGGC ACAACGCCAC CCCGTTCGAG CTGCTGCACC GTCCCGACCT GGTCGAGAAG CTGGAAGCCT CCGGCACCGA CCTGCAGCAC GCCCTGCAGA AGATCGCCAT CCCCGAGGCC GAAGCCCGGG GCATGTCGGT CCACGAACTG ATCCGCACGT TCCAGGGCCT GGTCGAGCGC ACCGTCGCCA ACCTGATGAA GGCCTTCAAG AAGGGCGCCC TGCCCGATCT CGACACGGAA GGCTTCGCCC GAGCCGCCGA GCGCCTGTCC ACCGATCCCG ACCGCGCCTT CCTGCTGGGG GCCGGCGTCG CCGCCTCGAT CGCGCCCGGC AAGAACTGGT CCGAGAAGAT CGCCCGCCTG GTCGATCTGG CCGACGCCGC CCCGACCGAA CCCAAGGCCC GGGCCGCCGC CCTGGCCGCC ATCGAGACGC CCCTGGCCGA GATCATCGGC TCCAAGGCCG GCATGGCCGA CCTGCTGGGC GCAGGCGACG CCGACCTCGG CACCACCCTG GCGGCCATGA CCCGCCTCGC CGGCGGGGCC CAGGTCGAGG GCCTGATCCG CGTGGAGGCC GGCGTGCGGC ACTGCATGCC CGAGCTGTCC GGCACGGCCA AGCGGCTGAG CGAGTGGCTG AGCGGCGAGG ACTTCCCCGC CGTCCGCGCC TCGATCGCCC ACCGGGTGCT CAAGGAACTG AACGGGGTGC GCCGCCTCAA GCCCTCCGAC GCCGAGGCCG AGATCGAACA CCTGCGCGCC CTGGCCATGA GCCTGACGGC CGCCGCCGGC CGCATTCTTC CCGCGGAAGA CATCACCAGC GCCTTCACCA CCCGCTCCAA GACCCTGCTG AACGGCGAGT TCATCGAAGC CCTGCTCGGT CGCGACCGCT CGTCGCGCGA GGAGATCCAG ATGCTGATCC GCCTGGCCGA GAACGTCATG GGCGCGGTCA ACAAGCGCAT GGCCGCCCGC TGGCTGTCGG CCAACGTCCT GGCCCTGCGC TTCGAGCGCG AACTGCGCCA GGGTCCCGAA TCGCCGGCCG CCAAGCTGGC CGCGCTGGCC ACCCTGCAAA AGTCCCTGGT CCGCTCGGGC CTGGTGGTCG AGGACTACCA GCCCCTGTGC GCCCGGCTGG GCGAGGTGGG CGGCATGATC GAGGCCGACG CCCGCCTGAT CGCCATGCTG GTCCGCGCCC CCGCGCCCCT GCCCCAGAAA CTGTCCCTGC TGATCAAGCT GGCCATGGGC GACGCCGGCC CGACCGGACC GGTCGCCGAC AAGGCCAAGC TCGAGGCGCT GAAACTGGCC CGCGCGCCCG AAGCCCGCGA ACAGCTGGCG GGCTCGCCGG AGACCATGGA CCTGCTCAAG GGCATGGTTC AGCAGAAGGC GGCGGCTTAG
|
Protein sequence | MSDQVHYELF VRRKPGAQWT LDMATEVRTH ALQTAEEALE QGRAIAVRVS KETLDNETRE YKSISIFTKG QVDGGKAKKV QEDLDPLCVQ PSDLYTAHAR DRIGRLLEGW LARHNATPFE LLHRPDLVEK LEASGTDLQH ALQKIAIPEA EARGMSVHEL IRTFQGLVER TVANLMKAFK KGALPDLDTE GFARAAERLS TDPDRAFLLG AGVAASIAPG KNWSEKIARL VDLADAAPTE PKARAAALAA IETPLAEIIG SKAGMADLLG AGDADLGTTL AAMTRLAGGA QVEGLIRVEA GVRHCMPELS GTAKRLSEWL SGEDFPAVRA SIAHRVLKEL NGVRRLKPSD AEAEIEHLRA LAMSLTAAAG RILPAEDITS AFTTRSKTLL NGEFIEALLG RDRSSREEIQ MLIRLAENVM GAVNKRMAAR WLSANVLALR FERELRQGPE SPAAKLAALA TLQKSLVRSG LVVEDYQPLC ARLGEVGGMI EADARLIAML VRAPAPLPQK LSLLIKLAMG DAGPTGPVAD KAKLEALKLA RAPEAREQLA GSPETMDLLK GMVQQKAAA
|
| |