Gene Lcho_3640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3640 
Symbol 
ID6163175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp4065113 
End bp4066438 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content67% 
IMG OID641666413 
Producthypothetical protein 
Protein accessionYP_001792659 
Protein GI171060310 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATCG CCATACTTTT CGCCCTCATC CTGCTCAACG GCCTGTTCGC GATGTCGGAG 
ATCGCGCTGG TCACGGCCCG CAAGGTCCGG CTGCAGAAGC TCATCGATGA GGGCGACCGA
TCCGCCGAGG CGGCGGTCAA GCTGGGGGAA GATCCGACCC GTTTCCTCTC CACGATCCAG
ATCGGCATCA CCTCGATCGG CGTGCTCAAC GGCATCGTCG GCGAGGCCGC GCTGGCCAAG
CCGCTGGCCT TGTGGCTGGA GTCGCTGGGT CTGTCGCAGC TGTATTCGAC CTACGCCGCC
ACCGGCCTGG TGGTGGTGCT GATCACCTAC TTCTCGATCG TGGTCGGCGA GCTGGTGCCC
AAGCGGGTCG GCCAGACCCA CCCCGAGACG CTGGCCCGGC TGGTGGCGCG CCCGATCAAC
TGGCTGGCGA TCGGCACCAA GCCTTTCGTG CGGCTGCTGT CGGTGTCGAC CCACGCGCTG
CTGCGCCTGC TGGGTGTCAA GGACAACGGC GGCAGCGCCG TCACCGAAGA AGAGATCCAC
GCCATGCTCG CCGAGGGCAC CAACGCCGGC GTGATCGAGT CGCACGAACA CGCGATGGTG
CGCAACGTCT TCCGCCTCGA CGACCGCCAG ATCGGCTCGC TGATGGTGCC GCGCGGCGAC
GTCACCTTCC TCGACGTCGA CCTGCCGTTC GAGCAGAACC TGGCGCGCAT CGAGCAGGCC
GATCACGCAC GGTTCCCGGT GGTCAAGGGC GGCAGCCTCG ACAACGTGCT GGGCGTCGTC
AACGCCCGCC AGTGGCTGTC GCGCTCGTTG CGGCTCGACG ACCGCAACCT CGCCGAGCAG
CCGCTGCAGC ACCCGCTGTA CGTGCCCGAG ACGCTCACCG GCATGGAACT GCTCGACAAC
TTCCGCCTGT CGGACGTGCA CATCGCCTTC GTGATCGACG AATACGGCGA GGTGCAGGGC
ATCGTCACGC TGCAGGACCT GATCGAGGCG ATCACCGGCG AGTTCCGCCC GCGCGATCCG
GAAACCTCGT GGGCGGTGCA GCGCGACGAC GGTTCGTGGC TGCTCGACGG CCACATCCCG
GTGCCGGAGC TGAAGGACCG GCTCGGCCTC GATTCGGTGC CCGAAGAAGA CCGCGGCCGC
TATCACACGC TCAGCGGCAT GCTGATGCTG CTGACCGGGC GCCTGCCCAA GGTGGCCGAC
ACCGCCAGCT GGGAAGGCTG GCGGCTGGAG ATCGTCGACA TGGACGGCAA GACCATCGAC
AAGGTGCTGG CGAGCCGCAT CCCGGAAGAG GCCGCGACGG ACGGCTCCGA GGTGTCTACC
GGCTGA
 
Protein sequence
MEIAILFALI LLNGLFAMSE IALVTARKVR LQKLIDEGDR SAEAAVKLGE DPTRFLSTIQ 
IGITSIGVLN GIVGEAALAK PLALWLESLG LSQLYSTYAA TGLVVVLITY FSIVVGELVP
KRVGQTHPET LARLVARPIN WLAIGTKPFV RLLSVSTHAL LRLLGVKDNG GSAVTEEEIH
AMLAEGTNAG VIESHEHAMV RNVFRLDDRQ IGSLMVPRGD VTFLDVDLPF EQNLARIEQA
DHARFPVVKG GSLDNVLGVV NARQWLSRSL RLDDRNLAEQ PLQHPLYVPE TLTGMELLDN
FRLSDVHIAF VIDEYGEVQG IVTLQDLIEA ITGEFRPRDP ETSWAVQRDD GSWLLDGHIP
VPELKDRLGL DSVPEEDRGR YHTLSGMLML LTGRLPKVAD TASWEGWRLE IVDMDGKTID
KVLASRIPEE AATDGSEVST G