Gene Lcho_4073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_4073 
Symbol 
ID6162025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp4566531 
End bp4567652 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content71% 
IMG OID641666851 
Producthypothetical protein 
Protein accessionYP_001793090 
Protein GI171060741 
COG category[S] Function unknown 
COG ID[COG5351] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00000125532 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCCATC CCGAACTCGT CAACCACACC GGCTTCGCCT TCGAAGCCCA GCTGCTGACC 
GACGAAGAAG GGGTGCCGCA GTTCGTCACC TGCGTGCAGG CCGTCTACAC GCTCGGCCCG
GGCGGCGCCT TGCAGCTGAT CGAGCCACAA CCGCCGGTGT TACTCGGTGG CAAGTGGCGG
GGTGACCCGG CCACCACCAG CCTGGTCAGC GAGCCGCAGA TCGCCTTCAT CAAGCCGGCC
ACCGACGTGG TGCTGATCGG TCATGCCCTG CCCACGTCGG CCGACCGCAC CGAGGGCCTC
GTGGGCCTGC GTGTCGGTCC GCTGCAAAAG ACCGTCAAGG TCTTCGGTGA CCGACGTGTC
GTGCGGCGGC TGGGCCTGGC GATGATCGGC AGGCCCGCGC CCTTCGAGCG CCTGCCGCTG
GTGTACGAGC GGGCCTTCGG TGGCTGGGAT CGCAGCGATG CCGACCCGGG CCAACACCGC
CGGGAGGCGC GCAATCCCGT CGGCGTGGGC CTGCGGGCCC ACCTGAAGCC CGAAGAAGAA
GCCTGGCTGC CCAACTTCGA GGATCCGCAG CACCTGATGG CTTCGGTCGA CGACACCCCT
CCGCCGGCCG GTTTCGGTTT CATCGGCCCC GACTGGCAGC CGCGCCTGGG TCTTGCCGGC
ACCTACGACG CCCTGTGGGT CAAGACGCGC CGGCCGCTGC TGCCGCGCGA CTTCGACCGT
CGCTTCTTCA ACGCCGCCTC GCCGGGGCTG GTCGCCCCCG GCTACCTGAG GGGTGACGAA
GTGGCGGTCG TGATCGGCAT GGCCCCCGAA GGCCGGGTCG ACTTCCGCCT GCCCGGCGGG
CCTGCGCCCG CCTGCCGCAT CGGGCTGCGC GGGCGCCGGT GGCAGGCGCT GCAGACCGTG
CTCGACACCG TCACCATCGA CCTCGACGCC CGCCGCGTCA CGCTGATGTG GCGCGCCCAC
CTCGCCGTGC GCAACGGCCC GCACGACGTG CTGGCCATCG AACTGCACCC CGATGCGCAG
GCCGCCGCCT GGCACGCCGC CGAGAAAGCC GCCGCACTCG CGCTGCTGAC ACGGGATGCG
GCCGAAGAAG ACGCCGCTCC CACCGCGAAC GACGAGGCAT GA
 
Protein sequence
MPHPELVNHT GFAFEAQLLT DEEGVPQFVT CVQAVYTLGP GGALQLIEPQ PPVLLGGKWR 
GDPATTSLVS EPQIAFIKPA TDVVLIGHAL PTSADRTEGL VGLRVGPLQK TVKVFGDRRV
VRRLGLAMIG RPAPFERLPL VYERAFGGWD RSDADPGQHR REARNPVGVG LRAHLKPEEE
AWLPNFEDPQ HLMASVDDTP PPAGFGFIGP DWQPRLGLAG TYDALWVKTR RPLLPRDFDR
RFFNAASPGL VAPGYLRGDE VAVVIGMAPE GRVDFRLPGG PAPACRIGLR GRRWQALQTV
LDTVTIDLDA RRVTLMWRAH LAVRNGPHDV LAIELHPDAQ AAAWHAAEKA AALALLTRDA
AEEDAAPTAN DEA