Gene Lcho_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_1954 
Symbol 
ID6162036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp2117347 
End bp2118924 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content70% 
IMG OID641664723 
Productprotein of unknown function DUF853 NPT hydrolase putative 
Protein accessionYP_001790986 
Protein GI171058637 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAC CCATCCTGCT TGCCCAACAC GGCGAGATCG AATGCCACAT GCTGCCGGCG 
CTGGCCAACC GTCACGGCCT GATCACCGGC GCCACCGGCA CCGGCAAGAC CATCACGCTG
CAGAAGCTGG CCGAGAGCTT CTCGCTGATC GGCGTGCCGG TCTTCATGGC CGACGTCAAG
GGCGACCTGA CCGGCATCAC GCAGACCGGC AGCGCCAACG CCAGGCTGGC CAAGGTGCTG
GCCGAACGCG GCTTGCCCGA GCCGGCCTGG GGCGCCTGCC CGGCCACGCT GTGGGACGTC
TTCGGCGAGC AGGGCCACCC GGTGCGCGCC ACCGTGTCCG ACATGGGCCC GCTGCTGCTG
GCGCGCATGC TCAACCTCAA CGAGACGCAG CAGGGCGTGC TGTCGATGGC GTTTCGCATC
GCCGACGACA ACGGCCTGCT GCTGCTCGAC CTGAAGGATC TGCGCGCCAT GCTGCAGTAC
CTGGGCGAGA ACGCCAGCGA GTTCACGACC GAATACGGCA ACATCAGCGC CGCCTCGGTG
GGCGCGATCC AGCGCGGCCT GCTGCAGATC GAGGAGCAAG GCGGCGACAG GTTCTTCGGC
GAGCCGATGC TCGCGATCGA CGACTTCATG CAGACCGTCG ACGGCCGCGG TGTCATCAAC
ATCCTGGCCG CTGACAAGCT GATGAACGCG CCGCGGCTCT ACGCCACCTT CCTGCTGTGG
ATGCTGTCGG AGCTGTTCGA ACTGCTGCCC GAGGTCGGCG ACCTCGAGAA GCCCAAGCTC
GCGTTCTTCT TCGACGAGGC GCACCTGCTG TTCAAGGACG CGCCGGCCGC GCTGGTCGAG
CGCATCGAGC TGGTGGTGCG GCTGGTGCGC TCCAAGGGCG TGGGCGTCTA TTTCGTGACG
CAGAACCCGC TCGACATCCC CGACAGCGTG CTCGGCCAGC TCGGCAACCG CATCCAGCAC
GCGCTGCGCG CCTTCACGCC GCGTGACCAG AAGGCGGTCA AGGCGGCGGC CGAGACCATG
CGCGCCAACC CCGGGCTCGA CGTCGCCAGC GCGATCACCG AGCTGGCGGT CGGCGAGGCG
CTGGTCAGCC TGCTCGACGA GAAGGGCCGC CCGGGCGTGA CCCAGCGCGT CTTCGTGCTG
CCGCCGGGCA GCCAGATCGG CCCGATCGAT GCCGAGCAGC GCAAGCGCCT GCTCGCCGAA
TCGCTGGTCG CGGGGGTCTA CGAAAAGACC ATCGACCGCG AATCGGCGCA TGAAAAGCTC
AAGGGCCGCG CCGCGCAATC CGTCGAAGCG GGCGAACACA AGCGTGACGG TGGCGGCCTG
GCCGGCCGAG GCGCCGGTGC GGGCGAGGCC GCCCCCGGTG CCGCCGACGA GAGCGGCGGC
GGCATGGCGG GCGCGCTGAT GAGCGGCCTG GGCGGCTTGC TGTTCGGCTC GACCGGGCCG
CGCGGCGGCC GCCATGACGG CCTGGCGCAG ACCATGGCCA AGTCGGCGGT GCGCTCGGTC
GGCTCGGCGG TCGGCCGCGA GATCATCCGC GGCGTGCTGG GCTCGCTGCT GGGCGGCGGC
TCGTCACGGC GTCGCTGA
 
Protein sequence
MAEPILLAQH GEIECHMLPA LANRHGLITG ATGTGKTITL QKLAESFSLI GVPVFMADVK 
GDLTGITQTG SANARLAKVL AERGLPEPAW GACPATLWDV FGEQGHPVRA TVSDMGPLLL
ARMLNLNETQ QGVLSMAFRI ADDNGLLLLD LKDLRAMLQY LGENASEFTT EYGNISAASV
GAIQRGLLQI EEQGGDRFFG EPMLAIDDFM QTVDGRGVIN ILAADKLMNA PRLYATFLLW
MLSELFELLP EVGDLEKPKL AFFFDEAHLL FKDAPAALVE RIELVVRLVR SKGVGVYFVT
QNPLDIPDSV LGQLGNRIQH ALRAFTPRDQ KAVKAAAETM RANPGLDVAS AITELAVGEA
LVSLLDEKGR PGVTQRVFVL PPGSQIGPID AEQRKRLLAE SLVAGVYEKT IDRESAHEKL
KGRAAQSVEA GEHKRDGGGL AGRGAGAGEA APGAADESGG GMAGALMSGL GGLLFGSTGP
RGGRHDGLAQ TMAKSAVRSV GSAVGREIIR GVLGSLLGGG SSRRR