Gene Plim_1742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1742 
Symbol 
ID9138443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2263322 
End bp2265175 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content59% 
IMG OID 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003629771 
Protein GI296121993 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.780857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAACA TCGCGCATTC GAACTCGATG GAACTTCCTC CACTGGGCAA TAACCCTTCC 
ACTCATCATG CCGGTACAAC GCCAGGCAGC TTCGCCAACC CCCCACGAGT CGCACCAGGT
GTCGAAGGTC GTTGGACTTT TTCCTCGCCC GACACACCGA ACATGCCGCA GCCTTCTGAA
AAGACGGCTT GGGATTTCCT CCCGGAGGGT TGGTCAACGG TGGAACTCGA AGAGGCAGCC
GAACTCTTCG ATTACACATC GGGTGCCACG CGGGTTGTCT TTCTGCAAAG TGACGACAGC
CAGCGTAAAG TCTGTTTGCC CGCCGGCTTC GAACCCTCGA CACAGCTTGA ATGGGCTCGT
GTGGGTGTGA TCACCCCGCA GATGGTGCGA GTTGCTGAAC GTGAAGATCA TCTCATGCCT
GCCCAGGTTC GTGACGAGAT CGCCGCTGGC CGGTTGGTGA TTCCCGCCAA CAAGCACCAC
CTCAAATACC AGCTCGATCC CATGGCCATT GGCCGGGCGA CGAAGACCAA GATCAATGCC
AACATGGGGG CCTCCCCCGT CTCCTCCAGC ACCGACGAGG AAGTCGAAAA GCTGAAGTGG
GCCGAACGCT GGGGTGCGGA TACGGTGATG GATCTTTCCA CCGGTGGCGA CCTCAACGCC
TGCCGCGTGG CGATCGTGCA GAACAGCACG GTCCCCATCG GGACGGTTCC CATCTATTCA
ATGATTATTG GGCGAAAGAT TGAAGAGCTC TCGCACGAGA TCATCCTTGA AAGTCTCGAA
CAACAGGCCC AACAAGGGGT CGATTACTTC ACGATCCATG CCGGTGTCTT GCGGGAACAT
CTTCCATTTG TGGTCAAGCG GCTCATCGGG ATTGTCAGCC GGGGTGGTTC GCTCCTCGCC
CAGTGGATGA TCCGCAACAG CGGCCAGAAT CCGATGTACG ATCGCTGGGA AGACATCTGC
GACATCATGC GCAAGCACGA TGTCACCTTC TCGATCGGCG ACGGCCTGCG TCCCGGCGGA
TTGGCCGATG CCACCGACCG CGCTCAACTC GCCGAACTGG CGACATTGGG TGAATTGACC
GAGCGCGCCT GGCGGAAGGG CGTGCAAGTC ATGATCGAAG GGCCAGGCCA CGTCCCTTTC
GACCAGATCG AATACAACAT GAAGCTCCAG CGGACGCTCT GCCACGGTGC CCCGTTCTAT
GTCCTTGGGC CGCTGGTGAC AGATATCTTC CCCGGCTATG ACCATATCAC CAGTTGTATT
GGTGCGACTG CCGCTGCCTA TCACGGCGCG AGCATGCTCT GCTATGTGAC CCCCAAGGAG
CACCTGGGCC TGCCCAAGAA AGACGACGTC AAGCAGGGCT GCATTGCCTA TAAGATTGCG
GCTCATGCGG CCGATGTGGC CCTCGGCATT CCCGGCACTC GCGACCGCGA CGACGAACTG
ACCAAGGCTC GCGCTGCCCT CAACTGGGAG AAGCACTTCG AGCTGAGCTT CGACCCCGAT
ACGGCCCGTG CCTATCACGA CGAGGACCTC GACGTCGACA CCGACTTCTG CGCCATGTGT
GGCCACGACT GGTGCAGCGT CCGCATCTCG AAGGAGATCG TCGAATTTGC TTCAGGTAAG
GACGAGAACT ACCAGTGGAA CCGCGCCAAG GTTTCTGCTG CTCTCACGCC CGAGCAGCAG
GAAATCCTCG AAAAGCGCGG TCACCTCTCC CCGCAGGAGA TCCATCAACT CGCCAGCAAG
ACCAAAAAGG TCGTTGGTGC CAATAAAGAC GCCAAAGCGG CCTGCCACAG CGACGTGGTC
GATGCAGAAA GTGCTAAGCA AATCCAGGTC GAACGACTGA GTTCGGCAAC GTGA
 
Protein sequence
MINIAHSNSM ELPPLGNNPS THHAGTTPGS FANPPRVAPG VEGRWTFSSP DTPNMPQPSE 
KTAWDFLPEG WSTVELEEAA ELFDYTSGAT RVVFLQSDDS QRKVCLPAGF EPSTQLEWAR
VGVITPQMVR VAEREDHLMP AQVRDEIAAG RLVIPANKHH LKYQLDPMAI GRATKTKINA
NMGASPVSSS TDEEVEKLKW AERWGADTVM DLSTGGDLNA CRVAIVQNST VPIGTVPIYS
MIIGRKIEEL SHEIILESLE QQAQQGVDYF TIHAGVLREH LPFVVKRLIG IVSRGGSLLA
QWMIRNSGQN PMYDRWEDIC DIMRKHDVTF SIGDGLRPGG LADATDRAQL AELATLGELT
ERAWRKGVQV MIEGPGHVPF DQIEYNMKLQ RTLCHGAPFY VLGPLVTDIF PGYDHITSCI
GATAAAYHGA SMLCYVTPKE HLGLPKKDDV KQGCIAYKIA AHAADVALGI PGTRDRDDEL
TKARAALNWE KHFELSFDPD TARAYHDEDL DVDTDFCAMC GHDWCSVRIS KEIVEFASGK
DENYQWNRAK VSAALTPEQQ EILEKRGHLS PQEIHQLASK TKKVVGANKD AKAACHSDVV
DAESAKQIQV ERLSSAT