Gene Lcho_2366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_2366 
Symbol 
ID6159754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp2570116 
End bp2571534 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content68% 
IMG OID641665135 
Productchain length determinant protein EpsF 
Protein accessionYP_001791396 
Protein GI171059047 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03017] chain length determinant protein EpsF 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTCT CCCAGTTCCT CTCGATCCTC AAGGCTCGCT GGATCGCCGC CCTGCTGGTG 
CTCGTGCTGA CCGTGGGCAC CACCATCGGC GTGAGCCTGA TGCTGCCCAA GAACTACACC
GCCTCGGCGG CCGTGGTGCT CGACGTGCGC TCGCCCGACC CGATCGCCGG CATGGTGCTC
GGCGCGATGG CGATGCCGGC CTACATGGCC ACCCAGGTCG ACATCATCCA GAGCGACCGC
GTCGCCCAGC GCGTGGTGCA GGGCCTGCGC CTGACCGAAA ACCCCGAGAC CCGCCAGCAG
TGGCAGGACG CCACCGGCGG CAAGGGCAAC TTCGAGGCCT GGCTGGCCGA CCTGCTCAAG
AAGAAGCTCG ACGTCAAGCC CTCGCGCGAG AGCAACGTCA TCAACATCGG CTACACCAGC
CCCGACCCGC GTTTTGCGGC GGCACTGGCC AACGCCTTCG TGCGCTCGTA CATGGACGTC
AGCATCGGCC TGCGGGTGTC GCCGGCCAAG CAGTACAACG AGTTCTTCGA CGCCCGCGGC
AAGGAACTGC GCGAGGCCCT CGAACAGGCC CAGGCCAAGC TCACCACCTA CCAGAAGACC
AGCGGCATCC TGGCCACCGA CGAGCGTTTC GACGTCGAGA ACCAGCGCCT CAACGAACTC
AGCTCGCAGC TCGTGGCCCT GCAGGCGCTG TCGGCCGAAT CGACCAGCCG CAGCGCGCAG
GCCCGCAACC AGGCCGACCA GCTGGGCGAC GTCATCAACA ACCCGGTGGT GGCCGGCCTG
CGTGCCGACC TGTCGCGCCA GGAAGCGCGC CTGATGGAAA TGAACTCCAA GCTCGGCGAC
GCCCACCCGC AGGTGGTCGA GCTGCGCGCC AACATCGCCG AACTGCGTCA GCGCATCGAC
GGCGAAACCC GCCGCGTCAG CGGCAGCGTG GGCATCAACA ACACCATCAA CAAGGCCCGC
GAAGGCGAAG TCCGCGCCGC CCTCGAAGCC CAGCGCGCCA AGGTGCTGGC GCTCAAGCAG
CAGCGCGACG AGGCGCTGGT GCTGATGAAG GAAGTCGAGA CCGCCCAGCG CGCCTACGAC
CAGGTGGTGG CCCGTGCCAG CCAGACCAAC CTCGAGAGCC AGAACACCCA GACCAACATC
TCGGTGCTCA CGCCCGCCAC CGAGCCGGCC GACCATTCGT CGCCCAAGCT GCTGCTCAAC
GCCTTGCTGA GCGTCTTCCT GGGTACCTTG CTGGCGGTCG GCTTTGCGCT GGTGCGCGAA
CTGATGGACC GCCGCGTGCG CACCGTCGAA GACCTGGCCG AAGGCCTCGG CCTGCCGGTG
CTGGGCGCGC TGCCCAAGCC GATGCGCGGA TCGGCCCGCA GCCCGGCGCT ACAGCTGCCC
ATCAACGTGA TGGCACGCCT TCCCCGTGCC GGCGCCTGA
 
Protein sequence
MTFSQFLSIL KARWIAALLV LVLTVGTTIG VSLMLPKNYT ASAAVVLDVR SPDPIAGMVL 
GAMAMPAYMA TQVDIIQSDR VAQRVVQGLR LTENPETRQQ WQDATGGKGN FEAWLADLLK
KKLDVKPSRE SNVINIGYTS PDPRFAAALA NAFVRSYMDV SIGLRVSPAK QYNEFFDARG
KELREALEQA QAKLTTYQKT SGILATDERF DVENQRLNEL SSQLVALQAL SAESTSRSAQ
ARNQADQLGD VINNPVVAGL RADLSRQEAR LMEMNSKLGD AHPQVVELRA NIAELRQRID
GETRRVSGSV GINNTINKAR EGEVRAALEA QRAKVLALKQ QRDEALVLMK EVETAQRAYD
QVVARASQTN LESQNTQTNI SVLTPATEPA DHSSPKLLLN ALLSVFLGTL LAVGFALVRE
LMDRRVRTVE DLAEGLGLPV LGALPKPMRG SARSPALQLP INVMARLPRA GA