Gene Plim_3373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3373 
Symbol 
ID9140089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4367008 
End bp4368348 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content56% 
IMG OID 
ProductPhoH family protein 
Protein accessionYP_003631385 
Protein GI296123607 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.349535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACTGG AGCACAGTTC GACGCAAGGC AAGCTCTTCG TTCTCGACAC CAACGTCATC 
CTTCACGATG CGGGGTGTTT ATTTAACTTC GAAGAGAATG ACATCGCCAT ACCGATCACT
GTGCTGGAGG AACTCGATCG ATTCAAAAAA GGTAACGACG ACATCAACTT TCAGGCCCGT
GCGTTCCTCC GACGATTGGA CGAACTCGCG GGCGATGTTC TTTCTGCAGA TGGCGCAGCC
TTAGGCGATG GCTTAGGCCG CATTCGTGTG GTGCTGCGTG GCCACTTTAC CGCCCGCATG
CGGGAAACTT TTCTCTCCGA CGGCCCCGAT CATCGCATTC TCGATGCCGC ACTGACTCTG
CAGGAAACGT CAGCTCCCCA GCCCGTTATC CTGGTTTCAA AAGACACCAA CCTGCGGATG
AAGGCCAAAT CCCTCGGGCT GCCCGCCGAA GATTATTCGA CCGATAAGGT CGAAAGCTTC
GACAAGCTCT ACACCGGAAA GCGGCTCGTC ACGAACATGC CTTGCGAAAG TGTTTCCGCC
TTCTATGCCG AAGGTGGCCG CGTGTCAGCA GAAAGCCTGC CGGAAGTAAC CACACCTCGC
GCTAACGAGA ACTTCATTCT GCGGAATGGT TCGCGCTCGG TCTTGGCCAT GTACAACGCA
GAAGAAAATG CATTCCACCG TGTGGAACGA ACCACCGCAC TGGGCGTGGT GCCTCGCAAT
GCCGAGCAGC ACTTCGCCTT GCGGGCACTT CTGGATGATG ACATCAAGCT GGTGACGATT
GCCGGCAAAG CCGGTTCGGG GAAAACGTTA CTCGCCCTCG CAGCGGCTCT CGAATGCCGG
AGCAACTACC GCCAGATTCT GCTCGCCCGG CCTGTCGTGC CACTTTCGAA TAAAGACCTG
GGCTATTTGC CGGGGGATGT GCATGCCAAG CTCGACCCTT ACATGCAGCC ACTCTTTGAT
AACCTTTCGG TCATCAAGCA TCAGAACCAT GAAGGCGATA CGGCGAAACT CGTCCAGCAG
ATGCAGGAAG ATCATCGGCT CGAAATTACG CCACTAGCGT ATATCCGCGG CCGCAGTTTG
CAGCGGGTCT TCTTCATTGT CGATGAAGCG CAGAACCTGA CACCGCACGA AGTGAAGACC
ATTATCACGC GGGCAGGCGA AGGAACAAAG ATCGTCCTGA CGGGCGATAT CCACCAGATC
GACCATCCCT ACCTCGATTC GCTCTCGAAC GGGCTGTCGT ACCTCATCAA CCGCATGGTC
GGCCAGAAGC TCTACGCCCA CGTGACGCTC GAAAAAGGCG AACGCTCCCA ACTCGCCGAA
CTCGCCACCG ATTTGCTTTA G
 
Protein sequence
MRLEHSSTQG KLFVLDTNVI LHDAGCLFNF EENDIAIPIT VLEELDRFKK GNDDINFQAR 
AFLRRLDELA GDVLSADGAA LGDGLGRIRV VLRGHFTARM RETFLSDGPD HRILDAALTL
QETSAPQPVI LVSKDTNLRM KAKSLGLPAE DYSTDKVESF DKLYTGKRLV TNMPCESVSA
FYAEGGRVSA ESLPEVTTPR ANENFILRNG SRSVLAMYNA EENAFHRVER TTALGVVPRN
AEQHFALRAL LDDDIKLVTI AGKAGSGKTL LALAAALECR SNYRQILLAR PVVPLSNKDL
GYLPGDVHAK LDPYMQPLFD NLSVIKHQNH EGDTAKLVQQ MQEDHRLEIT PLAYIRGRSL
QRVFFIVDEA QNLTPHEVKT IITRAGEGTK IVLTGDIHQI DHPYLDSLSN GLSYLINRMV
GQKLYAHVTL EKGERSQLAE LATDLL