Gene Lcho_1062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_1062 
Symbol 
ID6162787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp1130970 
End bp1133774 
Gene Length2805 bp 
Protein Length934 aa 
Translation table11 
GC content70% 
IMG OID641663816 
ProductDNA polymerase I 
Protein accessionYP_001790096 
Protein GI171057747 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCAG ACCGCGCCCG CCCCACCCTC CTGCTGGTCG ACGGCTCCAG CTACCTCTAC 
CGCGCCTACC ACGCGATGCC CGACCTGCGC GGCCCCGACG GCCAGCCCAC CGGTGCCATC
CACGGCCTGG TGGCCATGCT CAAGCTGCTG CGCAGCCAGA TCGGCGCCGA ACACGCGGCC
TGCGTCTTCG ACGCCAAGGG CCCGACCTTC CGCGACGACT GGTACCCGCT CTACAAGGCC
CAGCGCCCGC CGATGCCCGA CGACCTGCGG CTGCAGATCG AGCCGATCCA CGAGGTCGTC
CGGCTGCTCG GCTGGCCGGT GCTCGAGGTG CCCGGCATCG AGGCCGACGA CGTCATCGGC
ACGCTCGCCC AGCTGGCCGA AAAGTCCGGC CACCGGGTGG TGATCTCCAC CGGCGACAAG
GACCTGGCGC AGCTCGTCAC CGAGCACGTC ACGCTGATCA ACACCATGAG CCGCGAGTCG
CTCGACATCG AGGGCGTCAA GGCCAAGTTC GGCGTGCCGC CGGACCGCAT CGTCGACTAC
CTGACGCTGA TGGGCGACAC GGTCGACAAC GTGCCCGGCG TCGAGAAGGT CGGCCCCAAG
ACCGCCGCCA AGTGGATCGC CGAACACGGC TCGCTCGACG GCGTGGTGGC CGCCGCCGGC
TCGATCAAGG GCGTGGCCGG CGAGAACCTG CGCCGCGCGC TCGACTGGCT GCCGACCGCG
CGCAAGCTCG TCACCGTCAA GACCGACTGC GACCTGAGCG GCCACGTGCC CGCGTGGCCG
GCGCTCGAGG CGCTGGCGCT GCGCGAGGTC GATCCCGACG GCCTGAGGGC GTTCTACATG
CGCAACGGCT TCAGGACCTG GACCCGCGAG CTGGAGGCGC AGGCCCCGAG CGCCCCCGTC
ATCGCTTCTG CCGCAGCCCC CGCGCTCGAC GACATGGCGC AGGCCTCGGG CAACGGCGAT
GCGCCGGCCC AGGCTCCGGC GCCGGCCGCC GATGGTGCGG CGGGTCGCTA CGAAACCATC
TTCACCACCG AGCAGCTCGA TGCCTGGATC GCGCGACTGC AGGCCGCGCC GCTGGTGGCG
ATCGACACCG AGACCGACTC GCTCGACCCG ATGCGTGCCC GCATCATCGG CATCAGCTTC
GCGGTGCAAC CGCTGGAGGC GGCCTACGTG CCGGTCGGCC ACGACTACCC CGGCGCGCCC
GACCAGCTGC CGCTCGACGA GGTGCTGGCG CGCCTGCGTC CCTGGCTCGA AGACGCCGGC
ACGCGCAAGG TCGGCCAGAA CATCAAGTAC GACAGCCACG TCTTCGCCAA CCACGGCGTC
ACGGTGCGCG GCTACGCCCA CGACACCATG CTCGAAAGCT ACGTGCTCGA AGCCCACAAG
CCGCACGGCC TGGCCAGCCT GGCCGACCGC CACCTCGGCC GCAGCGGCAT CAACTACGAA
GACCTGTGCG GCAAGGGCGC CGGCCAGATC CCGTTCAGCC ACGTGGCGGT CGACCAGGCC
AGCACCTACT CGGGCGAAGA CAGCGAGATG ACGCTGCAGG TGCACCAGAC GCTGTGGCCG
CAGCTCGAAG CCGAACCGCG CCTGCGCGAG CTCTACGAAA CCATCGAGAT GCCGAGCGCC
GAGGTGCTGG TGCGCATCGA GCGCAACGGC GTGCTGATCG ACGCGGGCGT GCTGGCGCGC
CAGAGCCACC AGCTCGCGCA GCGCATGCAC ACGCTCGAAC AGGAGGCCCA TGCGATTGCC
GGCCAGCCCT TCAACATGAG CAGCCCCAAG CAGATCGGCG AGATCCTGTT CACCAAGCTC
GGCCTGCCGG TCAAGAAGAA GACCGCCAGC GGCGCGCCCA GCACCGACGA GGAAGTGCTG
CAGGAACTCG CCGCCGACTA CCCGCTGCCG GCGCGCATCC TCGAACACCG CAGCCTGGCC
AAGCTCAAGG GCACCTACAC CGACAAGCTG CCGCTGATGG TCAACCCGTC GACCGGCCGC
GTGCACACCA ATTACGCGCA AGCCGTGGCG ATCACCGGGC GGCTGTCGAG CAACGATCCG
AACCTGCAGA ACATCCCCAT CCGCACCGCC GAGGGCCGGC GCGTGCGCGA GGCCTTCGTC
GCCGCGCCCG GCCACAAGCT GGTCAGCGCC GACTACTCGC AGATCGAGCT GCGGCTGATG
GCGCACATCT CGGGCGACGC CAACCTGCTG AAGGCCTTCG CCGACGGCAT GGACGTGCAC
CGCGCCACCG CCGCCGAGGT CTTCAACATC GCCGCCGCCG ACGTCACCAG CGAGCAGCGC
CGCTACGCCA AGACCATCAA CTTCGGCCTG ATCTACGGCA TGGGCGCGTT CGGCCTGGCG
GCCAGCCTGG GCATCGAGCA GAAGGCGGCG CGTGACTACA TCGAGCGCTA CTTCGCGCGT
TACCCGGACG TCAAACGCTA CATGGACGAG ACCAAGGCCG GCGCGGCGCA GCTCGGTCAT
GTCGAGACGC TGTTCGGCCG CCGCATCGTG CTGCCCGAGA TCAAGGGCGG CAACGGCCCG
CGCAAGGCCG CCGCCGAACG CCAGGCCATC AACGCGCCGA TGCAGGGCAC CGCGGCCGAC
CTGATCAAGC TGGCGATGAT CGCGGTGCAG AAGGCCATCG ACGACGAAGG CCGCGCCAGC
AAGATGATCA TGCAGGTGCA CGACGAACTG GTGCTCGAGG TGCCCGAGGC CGAACTCGGC
TGGGCGCGTG AGGCGCTGCC GCGGCTGATG GCCGGCGTGG CGCAGCTCAA GGTGCCGCTG
GTGGCCGAGG TGGGCGAAGG GGCCAATTGG GAGGAAGCCC ATTGA
 
Protein sequence
MNPDRARPTL LLVDGSSYLY RAYHAMPDLR GPDGQPTGAI HGLVAMLKLL RSQIGAEHAA 
CVFDAKGPTF RDDWYPLYKA QRPPMPDDLR LQIEPIHEVV RLLGWPVLEV PGIEADDVIG
TLAQLAEKSG HRVVISTGDK DLAQLVTEHV TLINTMSRES LDIEGVKAKF GVPPDRIVDY
LTLMGDTVDN VPGVEKVGPK TAAKWIAEHG SLDGVVAAAG SIKGVAGENL RRALDWLPTA
RKLVTVKTDC DLSGHVPAWP ALEALALREV DPDGLRAFYM RNGFRTWTRE LEAQAPSAPV
IASAAAPALD DMAQASGNGD APAQAPAPAA DGAAGRYETI FTTEQLDAWI ARLQAAPLVA
IDTETDSLDP MRARIIGISF AVQPLEAAYV PVGHDYPGAP DQLPLDEVLA RLRPWLEDAG
TRKVGQNIKY DSHVFANHGV TVRGYAHDTM LESYVLEAHK PHGLASLADR HLGRSGINYE
DLCGKGAGQI PFSHVAVDQA STYSGEDSEM TLQVHQTLWP QLEAEPRLRE LYETIEMPSA
EVLVRIERNG VLIDAGVLAR QSHQLAQRMH TLEQEAHAIA GQPFNMSSPK QIGEILFTKL
GLPVKKKTAS GAPSTDEEVL QELAADYPLP ARILEHRSLA KLKGTYTDKL PLMVNPSTGR
VHTNYAQAVA ITGRLSSNDP NLQNIPIRTA EGRRVREAFV AAPGHKLVSA DYSQIELRLM
AHISGDANLL KAFADGMDVH RATAAEVFNI AAADVTSEQR RYAKTINFGL IYGMGAFGLA
ASLGIEQKAA RDYIERYFAR YPDVKRYMDE TKAGAAQLGH VETLFGRRIV LPEIKGGNGP
RKAAAERQAI NAPMQGTAAD LIKLAMIAVQ KAIDDEGRAS KMIMQVHDEL VLEVPEAELG
WAREALPRLM AGVAQLKVPL VAEVGEGANW EEAH