Gene Lcho_0967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_0967 
Symbol 
ID6161476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp1033191 
End bp1034345 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content70% 
IMG OID641663718 
Producttetratricopeptide repeat protein 
Protein accessionYP_001790004 
Protein GI171057655 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTTG ACCTGCAATG GCTGCTGCTG GGCCTGCCGG TGGCGTTCGC GCTCGGCTGG 
CTGGGCTCGC GCCTCGACCT GCGCCACCTC AGGCGCGAAA CCGAATCCTC GCCGCGGGCC
TATTTCAAGG GCCTGAACCT GCTGCTCAAT GAACAGCAGG ACAAGGCCAT CGATGCCTTC
ATCGAGGCGG TGCAGCAAGA CCCGGGCAGC ACCGACCTGC ACTTCGCGCT CGGCAACCTG
TTCCGTCGCC GCGGTGAATA CGAACGCGCC GTGCGGGTCC ACCAGCACCT GCTGGCACGC
GCCGATCTGC CCACCAGCGA GCGCGACCGC GCCCAGCATG CCCTCGCCCA GGATTACCTG
AAAGCCGGCC TGTTCGACCG TGCCGAGGCG GCCTACAAGG CGCTCGAAGG GACGGCCTTC
GCCACCGATG CGCGGCTGGC GTTGCTGACC TTGCACGAGT CCGCGCGGGA CTGGAAATCG
GCCATCGAAG TGGCCCGCGG GCTCGAGGCC ACCGCTGCCG GCAGCTTTGC CCAGCGCATC
GCCCACTACT GGTGCGAGCT GTGTCTGGAG GCCGATGCGG CGGGTGACGG CGCCGCCGCC
GACGCCGCGC TGACCAAGGC GCGCGAAGTG GCCCCGCAGT CGGCACGGCC GCTGATCCTG
TCGGGCCAGC GCCTGGCGCG TGCGGGCCGG CACACCGAGG CCCTGGGCCT GTGGACCGCG
CTGTCGACCG TGCACCCGGA AGCCTTTTCA GTCATCGCCG GTGACTATGC CGCCAGCGCC
CAGGTCTGCC AGCGTGCCGA CGAGGCGCTG GTCCGGCTCA AGGCCCTGCA TCTGGCGGCG
CCCTCTGCCG ACCTGCTGCT GGCTGCGCTG AGCCTGGAGT CCGACGCCGC GGCGCGGCGC
CGGATGCTGG TGCAGCACCT CAAGGAAAAT CAGAGCCTGA GCGCCGCACT CAAGCTGTTG
CAGGACCCGG CCGCCGCGCC GGACGACGAT GGCGGCGAAA GCCTGGCCAT GCAGCAGGCC
GTCGGCAAGG CCTTGCGCCC CTTGCGCCGC TACCACTGCG CGGCTTGCGG TTTCGAGGCA
CAGAACTACT TCTGGCAATG CCCCGGCTGC CACGGCTGGG ACACCTATCC GCCGCGTCGA
CTCGAGGACA TGTAG
 
Protein sequence
MDFDLQWLLL GLPVAFALGW LGSRLDLRHL RRETESSPRA YFKGLNLLLN EQQDKAIDAF 
IEAVQQDPGS TDLHFALGNL FRRRGEYERA VRVHQHLLAR ADLPTSERDR AQHALAQDYL
KAGLFDRAEA AYKALEGTAF ATDARLALLT LHESARDWKS AIEVARGLEA TAAGSFAQRI
AHYWCELCLE ADAAGDGAAA DAALTKAREV APQSARPLIL SGQRLARAGR HTEALGLWTA
LSTVHPEAFS VIAGDYAASA QVCQRADEAL VRLKALHLAA PSADLLLAAL SLESDAAARR
RMLVQHLKEN QSLSAALKLL QDPAAAPDDD GGESLAMQQA VGKALRPLRR YHCAACGFEA
QNYFWQCPGC HGWDTYPPRR LEDM