Gene Lcho_0643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_0643 
Symbol 
ID6161907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp694396 
End bp695592 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content52% 
IMG OID641663393 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_001789683 
Protein GI171057334 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3524] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAAA TTACACAACA GACAGGCGCC GATACAAAAA CTGGAAATGT CATATCTGAT 
ATGAATGCCA GCGAGGAGCT TTCCCTCATC GATGTCCTGT TAATCCTCTC ACGACGCAAA
CTGCTCCTCG TCGTCGCCCC TCTGCTAGTT GGTTGCGCGG CACTCGGAAT GAGTTATCTC
GTCCGACCGA CCTACACCGC GAGCGTCCAG TTACTACCTC CGCAGCAACA ACAAAGTGGC
GCACTCACGG CACTACTAGG CGCTGCGGGC GGTATGGCAA GTGCGCTTGG CGGCATCTCC
GGTCTTAAAA ATCCGTCAGA TCAGTGGATC GGCATGCTGA AGAGTCGAGC CATCGTCGAT
GCAATTGTTA ACGAATTCAG TTTGCGTGAG ATCTACGAAG TCGAATACCA ATTCAAAGCC
CGTGAAAGGC TCGAGAAAAA CAGCCGGATT GTGGCCGGCA AAGACGGACT GATCGACATC
GAGGTTGACG ATCACGATCC TGAGCGTGCT GCGAAGATAG CGACCGCCTA TGTCGACGAA
CTCCAAAATT TAATGCGCAC CCTTGCAGTG ACGGAAGCGG CACAGCGACG CTTGTTTTTT
GAGAAACAGC TTTCCGACAC GAAGACCAAG CTCATTAAAG CCGAGATACT TTTAACGGAA
GGCGGAATCA ATACAGGGGT ATTGAAGACC AATCCCGAAG CAGCGGTCAG TCAACTCGCG
CAGGCGCAGG CAGCAGTAAC CGCCCAGGAG GTTAAGGTAT CAGTGATGCG AGAATCGATG
ACCAACAGCA ACCCTCAGTT GCGAAATGCG ATACTGGAAC TAGCATCACT ACGTGAACAA
CTTCATAGAT CCAATCGTGA CGAGCCAGAA CGCGCCAAAG GAAGCGGCGC AGAGTACGTG
ACACGGTTTC GTGATTTCAA ATATTACGAA ACCCTATTCG ACTTGTTTGC TCGTCAATAC
GAGATGGCTC GCGCAGATGA GGCTCGTGAC GGATCGGTAA TCCAGGTAAT TGATCCCGCG
CAGGTACCCG AGTACAAATC CGGCCCCAAG CGCGGCATGA TCGCAGTGCT TGCCACGATT
CTGACATTCA TACTGTCCGT GCTGTATGTA CTGGCAAGCC ACGCCCTTAG AGGCTATACG
ACGCGAGCAG ATGGCCGTCT CAAAATGGAT GCACTGAAGC AGGCCATTAT CCGCTGA
 
Protein sequence
MSQITQQTGA DTKTGNVISD MNASEELSLI DVLLILSRRK LLLVVAPLLV GCAALGMSYL 
VRPTYTASVQ LLPPQQQQSG ALTALLGAAG GMASALGGIS GLKNPSDQWI GMLKSRAIVD
AIVNEFSLRE IYEVEYQFKA RERLEKNSRI VAGKDGLIDI EVDDHDPERA AKIATAYVDE
LQNLMRTLAV TEAAQRRLFF EKQLSDTKTK LIKAEILLTE GGINTGVLKT NPEAAVSQLA
QAQAAVTAQE VKVSVMRESM TNSNPQLRNA ILELASLREQ LHRSNRDEPE RAKGSGAEYV
TRFRDFKYYE TLFDLFARQY EMARADEARD GSVIQVIDPA QVPEYKSGPK RGMIAVLATI
LTFILSVLYV LASHALRGYT TRADGRLKMD ALKQAIIR