Gene Caul_4705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4705 
Symbol 
ID5902167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5091381 
End bp5092640 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content66% 
IMG OID641565224 
Productsodium:dicarboxylate symporter 
Protein accessionYP_001686323 
Protein GI167648660 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.600278 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAC GTTTCGCCTA CCTGATCATC GCGTCCATGA TCCTCGGGGT CCTGGTCGGC 
TGGGCCTGCA ACCAGTACCT CGACGCCGCC CAGACCGCCG AAGCGGTCAA GTGGTTCAAG
ATGGGGACGG ACCTGTTCCT GCGGCTGATC AAGATGATCA TCGCCCCCCT GGTCCTCACC
ACCCTGGTGG CCGGCATCGC CCACATGGAG GACGCCGCCG CCGTCGGCCG GATCGGCGCC
AAGACCATGG GCTGGTTTAT CAGCGCCTCG GCCGTCTCGC TGCTGCTGGG TCTGCTGATG
GTGCATCTGC TGCATCCCGG CGCGGGCCTG GTGCTGAACG AGGCGACCAA CGTGGCCGCC
AACGCCCCGG CCGCCTCGAC CGAGACCTTC ACCCTGCAGG GCTTCATCAC CCACCTGGTG
CCGGCCTCGA TCTTCGAGGC CATGGCCAAG AACGAGATCT TGCAGATCGT GGTCTTCAGC
CTGTTCGTCG GCACCGCCGT GGCCTCGCTG GACAACAAGG CCCCGCACAT CCTGGAGCTG
GCCGAGCAGG GCGCCCAGGT CATGCTCAAG GTCACCGGGT TCGTGATGAA GCTGGCCCCG
CTGGCGATCT TCTGCGCCCT GGCCTCGACC ATCGCCGCCC AGGGCATCTC GATGCTGGTC
GTCTATGGCA AGTTCGTGCT GGGCTTCTAC GCCACCATGG GCACGCTCTG GCTGCTGCTG
TTCATCGCCG CCTTCCTCGT GCTGGGCAAG CGGGCGATCC CGCTATTTGG CGCGATCCGC
GAGCCGGCCC TGCTGGCCTT CTCGACCGCC AGCTCGGAAG CCGCCTATCC GCGTATCCTC
GACGTCCTGC CGAAGCTGGG CATTCGTCGT CGCATCGTCT CGTTCGTCCT GCCGCTCGGC
TATTCGTTCA ATCTCGACGG CTCGATGCTC TACTGCACCT TCGCGACGGT CTTCATCCTC
CAGGCCCACG GCGTGCACCT GACGATCCAG CAGCAGATCT TCATGCTGCT GCTGCTGATG
GTCACCTCGA AGGGCATCGC CGGCGTGCCG CGCGCCTCGC TGGTCGTCAT CATGGCCACC
CTGACCTATT TCGGCCTGCC CGAGGCCTGG ATCGCCTTGG TGCTCGGCGT CGATCACCTG
CTCGACATGG GCCGCAGCGC CACCAACGTG GTCGGCAATT CGGTCGCCGC CGCCGTGGTC
GCCAAGTGGG AGGGCGAGCT GGACGATCCA GAGCCCGAGG CGGCCTCCGC GAAGGCCTAG
 
Protein sequence
MNKRFAYLII ASMILGVLVG WACNQYLDAA QTAEAVKWFK MGTDLFLRLI KMIIAPLVLT 
TLVAGIAHME DAAAVGRIGA KTMGWFISAS AVSLLLGLLM VHLLHPGAGL VLNEATNVAA
NAPAASTETF TLQGFITHLV PASIFEAMAK NEILQIVVFS LFVGTAVASL DNKAPHILEL
AEQGAQVMLK VTGFVMKLAP LAIFCALAST IAAQGISMLV VYGKFVLGFY ATMGTLWLLL
FIAAFLVLGK RAIPLFGAIR EPALLAFSTA SSEAAYPRIL DVLPKLGIRR RIVSFVLPLG
YSFNLDGSML YCTFATVFIL QAHGVHLTIQ QQIFMLLLLM VTSKGIAGVP RASLVVIMAT
LTYFGLPEAW IALVLGVDHL LDMGRSATNV VGNSVAAAVV AKWEGELDDP EPEAASAKA