Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4705 |
Symbol | |
ID | 5902167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5091381 |
End bp | 5092640 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641565224 |
Product | sodium:dicarboxylate symporter |
Protein accession | YP_001686323 |
Protein GI | 167648660 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.600278 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAAC GTTTCGCCTA CCTGATCATC GCGTCCATGA TCCTCGGGGT CCTGGTCGGC TGGGCCTGCA ACCAGTACCT CGACGCCGCC CAGACCGCCG AAGCGGTCAA GTGGTTCAAG ATGGGGACGG ACCTGTTCCT GCGGCTGATC AAGATGATCA TCGCCCCCCT GGTCCTCACC ACCCTGGTGG CCGGCATCGC CCACATGGAG GACGCCGCCG CCGTCGGCCG GATCGGCGCC AAGACCATGG GCTGGTTTAT CAGCGCCTCG GCCGTCTCGC TGCTGCTGGG TCTGCTGATG GTGCATCTGC TGCATCCCGG CGCGGGCCTG GTGCTGAACG AGGCGACCAA CGTGGCCGCC AACGCCCCGG CCGCCTCGAC CGAGACCTTC ACCCTGCAGG GCTTCATCAC CCACCTGGTG CCGGCCTCGA TCTTCGAGGC CATGGCCAAG AACGAGATCT TGCAGATCGT GGTCTTCAGC CTGTTCGTCG GCACCGCCGT GGCCTCGCTG GACAACAAGG CCCCGCACAT CCTGGAGCTG GCCGAGCAGG GCGCCCAGGT CATGCTCAAG GTCACCGGGT TCGTGATGAA GCTGGCCCCG CTGGCGATCT TCTGCGCCCT GGCCTCGACC ATCGCCGCCC AGGGCATCTC GATGCTGGTC GTCTATGGCA AGTTCGTGCT GGGCTTCTAC GCCACCATGG GCACGCTCTG GCTGCTGCTG TTCATCGCCG CCTTCCTCGT GCTGGGCAAG CGGGCGATCC CGCTATTTGG CGCGATCCGC GAGCCGGCCC TGCTGGCCTT CTCGACCGCC AGCTCGGAAG CCGCCTATCC GCGTATCCTC GACGTCCTGC CGAAGCTGGG CATTCGTCGT CGCATCGTCT CGTTCGTCCT GCCGCTCGGC TATTCGTTCA ATCTCGACGG CTCGATGCTC TACTGCACCT TCGCGACGGT CTTCATCCTC CAGGCCCACG GCGTGCACCT GACGATCCAG CAGCAGATCT TCATGCTGCT GCTGCTGATG GTCACCTCGA AGGGCATCGC CGGCGTGCCG CGCGCCTCGC TGGTCGTCAT CATGGCCACC CTGACCTATT TCGGCCTGCC CGAGGCCTGG ATCGCCTTGG TGCTCGGCGT CGATCACCTG CTCGACATGG GCCGCAGCGC CACCAACGTG GTCGGCAATT CGGTCGCCGC CGCCGTGGTC GCCAAGTGGG AGGGCGAGCT GGACGATCCA GAGCCCGAGG CGGCCTCCGC GAAGGCCTAG
|
Protein sequence | MNKRFAYLII ASMILGVLVG WACNQYLDAA QTAEAVKWFK MGTDLFLRLI KMIIAPLVLT TLVAGIAHME DAAAVGRIGA KTMGWFISAS AVSLLLGLLM VHLLHPGAGL VLNEATNVAA NAPAASTETF TLQGFITHLV PASIFEAMAK NEILQIVVFS LFVGTAVASL DNKAPHILEL AEQGAQVMLK VTGFVMKLAP LAIFCALAST IAAQGISMLV VYGKFVLGFY ATMGTLWLLL FIAAFLVLGK RAIPLFGAIR EPALLAFSTA SSEAAYPRIL DVLPKLGIRR RIVSFVLPLG YSFNLDGSML YCTFATVFIL QAHGVHLTIQ QQIFMLLLLM VTSKGIAGVP RASLVVIMAT LTYFGLPEAW IALVLGVDHL LDMGRSATNV VGNSVAAAVV AKWEGELDDP EPEAASAKA
|
| |