Gene Lcho_1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_1114 
Symbol 
ID6162296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp1189881 
End bp1190867 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content66% 
IMG OID641663868 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_001790148 
Protein GI171057799 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.347843 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTCC GTCGCACCCT GCTGGCCGCC TGCGCCGCTG CCGCCACCCT CGGTTTCGCC 
CCGCTGACCA CCTCGGCCCA GAACATCGTC ATGAAGGCCG CCGACGTGCA CCCGGCCGGC
TACCCGACCG TGGTCGCGGT CGAGAACATG GGCAAGAAGC TCGACGCCGC CACCAGCGGT
CGCATCAAGT TCCAGATGTT CCCCGGCAGC GTGCTCGGCG GCGAGAAGGA GATGATCGAG
CAGACCCAGT TCGGCGCGAT CCAGCTGCTG CGCACCTCGC TGGGCCCGAT CGGCCCGGTG
GTGCCCGAGG TGAACGTGTT CAACATGCCC TTCGTGTTCC GCAACATCGC CCACATGCGC
GCGGTGATCG ACGGCCCGAT CGGCCAGGAA CTGCTCGACA AGGTCAGCGC CTCGCCGGCC
CGGATGGTCG CGCTGGCCTG GATGGACGGC GGCTCGCGCA GCCTCTACAC CAAGAAGCCG
GTGCGCAGCC CCGCCGACCT GAAGGGCCAG AAGATCCGCA TGATGGGCAA CCCGCTGTTC
GTCGACACCA TGAACGCGAT GGGCGGCAAC GGCATCGCGA TGGGCTACGG CGAGGTCTTC
ACCGCCATCC AGACCGGCGT GATCGACGGC GCCGAGAACA ACCCGCCCAG CCTCTACACC
GCCAACCACT TCAAGGCCGG CGCCAAGTAC TTCACCCAGA CCAACCACCT GATCATTCCC
GAGATCCTGG TGATGTCGAA GGTGACATTC GACAAGCTCA GTCCGGCCGA CCAGGCGCTG
GTCAAGAAGA CCGCCCGCGA AGCGCAGCTC GAGCAGCGCA CGCTGTGGGA CAAGGCGGTG
GCCGACTACA CCACCAAGCT CAAGGCCGAA GGCGTCGAGT TCATCGAGAT GGACAGCAAG
CCCTTCTTCG ACGCCACCGC ACCGGTGCGC GCCAAGTACG GCGCCAACTT CGCCGACCTG
ATGAAGCGCA TCGAGGCCGT CAAGTAA
 
Protein sequence
MNFRRTLLAA CAAAATLGFA PLTTSAQNIV MKAADVHPAG YPTVVAVENM GKKLDAATSG 
RIKFQMFPGS VLGGEKEMIE QTQFGAIQLL RTSLGPIGPV VPEVNVFNMP FVFRNIAHMR
AVIDGPIGQE LLDKVSASPA RMVALAWMDG GSRSLYTKKP VRSPADLKGQ KIRMMGNPLF
VDTMNAMGGN GIAMGYGEVF TAIQTGVIDG AENNPPSLYT ANHFKAGAKY FTQTNHLIIP
EILVMSKVTF DKLSPADQAL VKKTAREAQL EQRTLWDKAV ADYTTKLKAE GVEFIEMDSK
PFFDATAPVR AKYGANFADL MKRIEAVK