Gene Caul_4108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4108 
Symbol 
ID5902567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4464748 
End bp4466052 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content68% 
IMG OID641564629 
Productsodium:dicarboxylate symporter 
Protein accessionYP_001685730 
Protein GI167648067 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0814495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.311636 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCTGC TTCGGCCGTT GAAGACGCTG TATGTCCAGG TGCTGATCGG CATCGCCCTG 
GGGGTGCTAG TCGGGGCGCT GTGGCCGGAG GTCGGGGTGG CGCTGAAGCC GCTGGGCGAT
GCGTTCATCA AGCTGGTCAA GCTGGTGATC GCCCCGGTGA TCTTCCTGAC CGTGGCCAGC
GGCATCGCCC ACATGGGCGA CATCAAGGCG TTCGGCCGGG TGGGGGTCAA GGCCCTGCTC
TATTTCGAGG TGGTCTCGAC CCTGGCCCTG GCGGTCGGGC TGGTCGTCGG CCACATCCTG
CAGCCGGGCC ACGGCTTCAA CATCGACCCG GCCACGCTGG ATCCGAAGAT CGCCGCCGGC
TACCTGGAGA AGGCCCATCA CGGCGAGGGG CTGGTTCCCT ATCTGCTGCA CCTGATCCCC
GACACCTTCT TCGGAGCCTT CGCCGAGGGC AACCTGCTGC AGGTGCTGGT GATCTCGGTG
CTGACGGGTT TCGCCTGCAC GCGCATGGGT CCGTTCGGCG ACCGTGCGGC GGCGGCGATG
AGCGACATGG CCAAGCTGTT CTTCGGCATC ATCCACGTGG TGGTCAAGTT GGCGCCGCTG
GGGGCGTTCG GGGCCATGGG CTTCACGATC GGCAAGTACG GGCTCGCCAG CCTGGTGCAG
TTAGGGGCGC TGGTGGCCAC CTTCTACATC ACCGCCCTCC TGTTCGTGCT GGTGGTGCTG
GGGGCGATCG CCTGGGCCTG CGGCTTCTCG ATCCTCAAGT TCCTGGCCTA TATCCGCGAG
GAGTTGCTGA TCGTGCTGGG CACCAGTTCG TCGGAGAGCG CCCTGCCCCA GCTGATCGAG
AAGCTGGAGC GGCTGGGCGC CCGCAAGTCG GTGGTCGGGC TGGTGGTGCC CACCGGCTAC
AGCTTCAACC TCGACGGCAC CAACATCTAC ATGACCCTGG CCACCCTGTT CCTGGCCCAG
GCCACCAACA CCCACCTGAG CTGGATCCAG ATGGCGAGCC TGCTGGGCGT CGCCATGCTG
ACCTCCAAGG GGGCCAGCGG GGTGACCGGG GCCGGCTTCA TCACCCTGGC CGCCACCCTG
GCCGTGGTCC CCGACATCCC GATCGCCGCC CTGGCGGTGC TGGTCGGGGT CGACCGGTTC
ATGAGCGAGT GCCGGGCCCT GACCAATTTC GTCGGCAACG GCGTGGCCAC CCTGGTGGTG
GCCCGCTGGG AGGGCGCCCT GGATCGCGAC CGGTTGGCGC GGGAACTGAC CCGAGGTCCC
AACGTCCCGC CCGTCGAGGT CGTCGAGGAA CTTCCCGCCG CCTGA
 
Protein sequence
MGLLRPLKTL YVQVLIGIAL GVLVGALWPE VGVALKPLGD AFIKLVKLVI APVIFLTVAS 
GIAHMGDIKA FGRVGVKALL YFEVVSTLAL AVGLVVGHIL QPGHGFNIDP ATLDPKIAAG
YLEKAHHGEG LVPYLLHLIP DTFFGAFAEG NLLQVLVISV LTGFACTRMG PFGDRAAAAM
SDMAKLFFGI IHVVVKLAPL GAFGAMGFTI GKYGLASLVQ LGALVATFYI TALLFVLVVL
GAIAWACGFS ILKFLAYIRE ELLIVLGTSS SESALPQLIE KLERLGARKS VVGLVVPTGY
SFNLDGTNIY MTLATLFLAQ ATNTHLSWIQ MASLLGVAML TSKGASGVTG AGFITLAATL
AVVPDIPIAA LAVLVGVDRF MSECRALTNF VGNGVATLVV ARWEGALDRD RLARELTRGP
NVPPVEVVEE LPAA