Gene Plim_3972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3972 
Symbol 
ID9140692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5094540 
End bp5095937 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content56% 
IMG OID 
Productprotein of unknown function DUF1501 
Protein accessionYP_003631982 
Protein GI296124204 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCATC ATCTACAGAA ATTGTTGAGT CAGAAACTTC CTCGCCGTCA GTTATTAACG 
GCCGGTGGCA TGGCTGGTTT TGGGTTGACC TTACCTCGGT GGCTGGCCGC TCAGGATCAA
GCGGCTGCTG AGTTACCGGC CGCCATGTCG ACGGCGAAGT CTGTCATCTT TCTTTACCAG
TTTGGTGGGC CAAGCCATGT TGATACCTTC GACATGAAGC CACTGGCACC GGATGGCACC
CGCAGCCAGT TTGAAACGAT CTCGACATCC GTCCCAGGAC TGTCAATCTG CGAGCACCTG
CCCCGAATGG CAGAGGTCAT GAATCGCGTT ACATTGCTCC GCACAGTGTG GCACACCATG
AAGAACCACA ACAGTGCCTC CTACTATGCA CTCACCGGCC ATCCACCCGC TGTCGATGAT
ATCCGCCTGC GCGACACGCT CGATCTCTTC CCGGCTTATG GTTCTGTGGT GGATCGATAT
GCACCCAATA CCAATGGCAT GCCGACATTT GTGGCTTACC CACACGTCAT TCGCGATGGC
GAAGTGACCC CCGGCCAGCA CGCGAGCTTT CTGGGGAAAG TGCACGATCC TCTTCTCGTC
ACCGCTGACC CAAATGCCCC AGGCTTCGGC TTGCCGGAAC TCAGCCTGCC AGCCGGCGTT
TCGACGGCAC GGCTCGAAAA TCGTCGGCAA CTGCAGCAGA TGATCAACGC TCAGGCCAAA
CTCGGCGATG CAGCAGTCGC TGCACGCGGC CTCGAAGATT ATTACTCCCG TGCTGTATCA
ATGCTGAACT CGCCAAAAAT CCGCCAGGCA TTTGCGATTG ATGAAGAGTC AGCAAGCGTT
AGAGATCGCT ATGGTCGCAC GGAGTATGGC CAAGGCTGTC TCCTGGCACG CCGTCTCGTC
GAGCGCGGCG TCAAGTTTGT CAGTGTCTAC TACTCGAAGA GTATTGGTGG CCGACGTAAA
GAAGAGGGCT GGGATACCCA CGGATTTGAT AACACCCGCA TGTATCCCAT TCTCAAAGAT
TATCACCTCC CCTTACTGGA TCAGACATTA CCGACATTGA TTCTCGATCT GGAAGAACGC
GGCCTGCTCG ACCAGACGCT CATTGTCTGG ATGGGCGAAT TTGGTCGCAC GCCCCGGCTC
AATGCCAATA TCAGCCGGGA TCACTGGCCG CAGTGCTATA GTGTGCTGCT GGCGGGTGGA
GGGACGAAAA AAGGCTACGT CCATGGCACA TCCGATAAGA CCGGTGCTTT CCCTGAGAAA
GATCCTGTGG CTTTGGACGA TCTCGCAGCG ACGATGTTCT CTGCGATCGG AGTTCCACCA
GAGACAGAAC TTCGAGATCG CGGCAATCGA CCACTCGCTG CAGCGCTCGG TCACGTTGTC
TCCGAAATCT TTGCTTAA
 
Protein sequence
MNHHLQKLLS QKLPRRQLLT AGGMAGFGLT LPRWLAAQDQ AAAELPAAMS TAKSVIFLYQ 
FGGPSHVDTF DMKPLAPDGT RSQFETISTS VPGLSICEHL PRMAEVMNRV TLLRTVWHTM
KNHNSASYYA LTGHPPAVDD IRLRDTLDLF PAYGSVVDRY APNTNGMPTF VAYPHVIRDG
EVTPGQHASF LGKVHDPLLV TADPNAPGFG LPELSLPAGV STARLENRRQ LQQMINAQAK
LGDAAVAARG LEDYYSRAVS MLNSPKIRQA FAIDEESASV RDRYGRTEYG QGCLLARRLV
ERGVKFVSVY YSKSIGGRRK EEGWDTHGFD NTRMYPILKD YHLPLLDQTL PTLILDLEER
GLLDQTLIVW MGEFGRTPRL NANISRDHWP QCYSVLLAGG GTKKGYVHGT SDKTGAFPEK
DPVALDDLAA TMFSAIGVPP ETELRDRGNR PLAAALGHVV SEIFA