Gene Lcho_1311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_1311 
Symbol 
ID6163682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp1398065 
End bp1399801 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content58% 
IMG OID641664066 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_001790345 
Protein GI171057996 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCGG TGCTGGAGGC GCGTGAACCG CGCGCGAGCT ACCTGGCCGC GTTGCAGCCG 
CCCCTGGTGC GGCAGTTCGA TCTGCTGGCC ACTGCGCCCG GTGGCGTGGC GCGACTGCGC
GAATTGATCC TGACGCTGGC GGTGCAGGGC AAGCTGGTGC CGCAGGATGC GAGCGACGAG
CCGGCGAGTG TGCTGCTGCA GAAGATTCGG GCGGAGAAGA ATCGGTTGAT TGCGGCGAAG
GAAATCAAGC GAGATAAGCC GCTGGCACCG ATAAGCGATG AAGAAATATC GTTTGATGCC
CCACGTGGGT GGGAATTGGT TCGTTTGGGC GACTTGGTGA ATGCCAGTGA AGCGGGCTGG
AGTCCAAGTT GTGCTGGTTC TCCACGCCGT GCCGGGCATT GGGGTGTATT GAAAGTCAGC
GCAGTTTCTT GGGGGAAATT TGATCCCGAA GCCAACAAAG AATTGCCCGC TGATCTGCAG
CCGAAACCAG AATACGAAGT GCGGTCCGGC GATTTTTTGC TATCTCGGGC CAACACGGAA
GAACTTGTCG CGCGTTCAGT TGTCGTTGGT GCTGTCGATC CGCGTTTGAT GCTCAGCGAC
AAGATCATTC GGCTGGACGT TGCAAATCCA ATTCATCGCG GTTTCCTCAA CTTTTGCAAC
AACGAAAAAA GTGCTCGTAC CCATTACGCA GCTAATGCGT CAGGCACAAG CAGTTCGATG
AAAAACGTTT CGCGCGAGGT TGTTCTGAAT CTTCCAATCG CGTTGCCACC CCTCGCCGAA
CAATCCCGCA TCGTCACCCG CGTCGAAGAA CTGATGCGGC TGTGCGATGC GCTCGAAAGC
CAACGCCAGC TCGAAACCGC GCAGCACGCC CAACTGCTGA ACACCCTGCT GGGCACGCTG
ACCGACAGCG CCTCGCCCGA CGAACTGGCC GCCAACTGGC AGCGCGTGAG CGATCACTTC
GACCTGTTGC TCGACCGCCC CCAAGCCGTG GACGCGCTGG AGCAGACCAT CCTGCAACTC
GCCGTGCGCG GGCTGCTGGT GCCGCAAGAC CCGACCGACG AGCCGGCGAG TGTGTTGCTG
CAGAAGATTC GGGCGGAGAA GGATCGGTTG ATTGCGGCGC GCAAGATCAA GCGGGACAAG
GCGTTGCCTA TTATTTCTGA CAAAGATGGA CTCGATGATC TGCCAGAAGG ATGGGTCGTG
GTTCGGCTGG GTGCAATTAT GGAGCTTGTA TCTGGTCAGC ATCTTGGTCC CGCTGAGTAT
GCAGAGGGTC TTGACTCTGG GATTCCATAT TTAACCGGCC CAGCTGAATT TGGTCCCCAA
TCGCCCAGCC CAACCAGATC AACTGTAGAG CGGCGCGCCA TTGCGATTTG GGGGGATATT
CTCATCACAG TTAAGGGTTC CGGCGTTGGA AAACTCAATG TGGTTGCGCA TTCGGAGATT
GCGATAAGCC GCCAATTGAT GGCGGTTCGC TCGATTGGCG TGAATGATGC GTTCCTCTTT
ATTGTGCTCA AAACGCTTGA GATTAAGTTC CAGATGCAGT CAGTCGGCAT AGCTATTCCT
GGCATTGGTA GAGAGGATGT TTCTCACTCA ATTCTTGGCC TACCACCCCT CGCTGAACAA
GCCCGCATCG TCGCCCGCGT CACCCAACTC CGCAGCCACT GCGCCGACCT GCGCCAACGC
CTGTCAGCCC GTCAGGCCAT CCAAAGCCAT CTGGCCGAGG CGCTGGTGGA GGTTTGA
 
Protein sequence
MSAVLEAREP RASYLAALQP PLVRQFDLLA TAPGGVARLR ELILTLAVQG KLVPQDASDE 
PASVLLQKIR AEKNRLIAAK EIKRDKPLAP ISDEEISFDA PRGWELVRLG DLVNASEAGW
SPSCAGSPRR AGHWGVLKVS AVSWGKFDPE ANKELPADLQ PKPEYEVRSG DFLLSRANTE
ELVARSVVVG AVDPRLMLSD KIIRLDVANP IHRGFLNFCN NEKSARTHYA ANASGTSSSM
KNVSREVVLN LPIALPPLAE QSRIVTRVEE LMRLCDALES QRQLETAQHA QLLNTLLGTL
TDSASPDELA ANWQRVSDHF DLLLDRPQAV DALEQTILQL AVRGLLVPQD PTDEPASVLL
QKIRAEKDRL IAARKIKRDK ALPIISDKDG LDDLPEGWVV VRLGAIMELV SGQHLGPAEY
AEGLDSGIPY LTGPAEFGPQ SPSPTRSTVE RRAIAIWGDI LITVKGSGVG KLNVVAHSEI
AISRQLMAVR SIGVNDAFLF IVLKTLEIKF QMQSVGIAIP GIGREDVSHS ILGLPPLAEQ
ARIVARVTQL RSHCADLRQR LSARQAIQSH LAEALVEV