Gene Plim_3322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3322 
Symbol 
ID9140037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4295331 
End bp4296737 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content55% 
IMG OID 
Productmetal-dependent phosphohydrolase HD sub domain protein 
Protein accessionYP_003631334 
Protein GI296123556 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCGAA ACTTCATCAG CGAAAGCCTC TCCCACGATC CGATTCACGG TTATATTCCG 
TTCATTTCGC GTGGGGGTCT GCCAGCGGGA GAGACAGCTG AGCAGGATAT CATTGATCAT
CCGTGGGTGC AGAGGCTCCG GCATATTCAT CAGTTACAGA CGGCCTGGTG GGTTTTTCCG
GCGGCTGAGC ACATGCGGTT TCAGCATGTG ATGGGGGCCA TGCATCTGGC CTCTGTGGCC
ATTGACTACT GGTACGATTC ACTGTGCGAT GCGTGCCGCA ATGTCCCTTC GCGGCCTTAT
GTCGAATCGC TATTGCGCAT GGCGGCTCTG CTCCACGACG TAGGGCATGG GCCGTTCGGC
CATTTCTTTG ATGATCATTA CCTGGCACAG TTCGGTGTCA CTCACGAAGA TGTGGGAGGC
GTGATTATCG AGCAGGAACT GGGAGAACTG CTGCGGGGGA TCCGGCGGAA TCCCAAAGGA
GAACTCAAGC CCCTGGAAGA ACTCGATCCC CGGCAGATTT CGTGGCTGAT CCGCCGGCCC
AGGCCAGGTT CCCCCGATGA CGAAGGCCAT CCCGACTGGC TCAAAAAACT GCGGGCCATG
TTCAGCGGTA TTTATACTGT AGACAACATG GATTTCGTCT TGCGCGATGC CTACATGACG
GGCTATAACG TGCGGGCGTT TGATATCTCC CGGTTGCTCC ACTATTCGTT CTTTACTCCT
CAGGGCTTAA CGATTCATAT TCGTGGCTTG CCCACATTGA TCAATTTCAT CGAAACCCGT
GCCAACCTCT TTCGCACCGT CTACTTTCAT CGCACTGTGC GGGCCATCGA TCTGGCACTG
GAAGATCTGT TTCCCGAAAC GATGCCTCAC CTGTTCCAGG GGAGCCCGAT TGGTCAGTTG
GAAAACTACC GCCGGTTGAC GGAATCGTCA TTTCTGGTCG ATGTGGATCG CTTCTCGCAA
TCAACCGATC CTGCGACACG CCAGTTGGGC GAGCGCTGGC AGAAGATTCT TTCGCGCGAA
GTCCATTGGA AGATGGCGAC AGAGCGCACC ATGGGGTACT ACACCAGCAG TGCCGAGCGG
ATGTCGATCT TCTCGGAACC TGACCTCATC GAGCAGCGCT TGCGCGGCCG GTTGCCAGAG
GATAAGAAGT CGATTGAAAT GCGAATCGAT GTGGCCCGGC ATTATCATCG GCCCAGTGGC
AGGCTCCCAG TGGGTGGTCA AAATCATCTG TTTGATCCAG CGGCGAATGG GATCACAGAA
CTGTCGGATG ATGAACTTTT CCGGCAGTTG CCACTGAGTC TGTCGGTATT CCGCGTGTAC
AGTCGGGATC ACGCGAACGA TAAGGCCATT ACGATGGCTT TGCAGAGCCT GCTGGGCGAT
GCCGGCGACG CCAAGACAAA TATGTAA
 
Protein sequence
MSRNFISESL SHDPIHGYIP FISRGGLPAG ETAEQDIIDH PWVQRLRHIH QLQTAWWVFP 
AAEHMRFQHV MGAMHLASVA IDYWYDSLCD ACRNVPSRPY VESLLRMAAL LHDVGHGPFG
HFFDDHYLAQ FGVTHEDVGG VIIEQELGEL LRGIRRNPKG ELKPLEELDP RQISWLIRRP
RPGSPDDEGH PDWLKKLRAM FSGIYTVDNM DFVLRDAYMT GYNVRAFDIS RLLHYSFFTP
QGLTIHIRGL PTLINFIETR ANLFRTVYFH RTVRAIDLAL EDLFPETMPH LFQGSPIGQL
ENYRRLTESS FLVDVDRFSQ STDPATRQLG ERWQKILSRE VHWKMATERT MGYYTSSAER
MSIFSEPDLI EQRLRGRLPE DKKSIEMRID VARHYHRPSG RLPVGGQNHL FDPAANGITE
LSDDELFRQL PLSLSVFRVY SRDHANDKAI TMALQSLLGD AGDAKTNM