Gene Plim_0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_0037 
Symbol 
ID9136690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp49565 
End bp50875 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content56% 
IMG OID 
Productoxidoreductase domain protein 
Protein accessionYP_003628089 
Protein GI296120311 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCAAT TGAATGGATC ACGTCGGGAG TTCCTCGCGA CGACCGGTAT GGTGGCAGGG 
GCGGCCCTGG CTGCACAGGC GGGCTTGGCC ACAAGTGCCT ATGCTCAAGG TGATGACGTG
ATCCGCATTG GCCTGGTGGG CTCTGGTGGA CGCGGAACAG GTGCAGCACG CGATGCTCTC
TCGACCGATC ACAACGTCAA GCTGGTCGCT GTGGGCGATG TGTTCGAAGA TCGCGCCAAT
CGGGCCGTGG GAAGTATCAA GGGCGGACTG GGCGACAAGG GAAGCAAAGT CGATGTTGCT
CCCGACAAGG TCTTCTTCGG CTTCGATGCC TTCAAGAAAG TGATTGATAG CGGTGTCGAT
CTCGTGATTC TGGCAACTCC TCCAGGCTTC CGCCCTCAGC AGTTCGAATA CGCCATTAAT
GCCGGCAAGC ATGTTTTCAT GGAAAAACCA GTCGCGACCG ACGCTCCCGG CGTCCGTCAG
GTGCTGGCCG CCAGCAAGCT GGCCAAAGAA AAGAACCTTA AGGTGGGCTG TGGTCTCCAG
AGACATCATG ACATTGGCTA TATCGAAACC ATCAACCGCA TCAAGGATGG CCAGATTGGC
GATGTGATTG CCATGCGGGC CTACTGGAAC AATGCCGGCG TGTGGGAACC ACCGCTCAAG
CGCGAAAACG CCAAGAGCGA AATGGAATAC CAGATGCGTA ACTGGTACTA CTACAACTGG
CTGTGCGGTG ACCACATCGT CGAGCAGCAC ATTCACAATC TCGACGTCTG TAACTGGGTG
AAGGGTGATT ACCCAGTCCG TGCCAATGGG ATGGGTGGCC GACAGGTTCG TGTCGATAAG
CGATATGGCG AAATCTACGA CCATCACTGC GTCGAATTCG AATACAAGGA TGGTTCCCGC
ACTTATTCCC AGTGCCGCCA TATTCCGAAC ACCTGGAATC AGGTCTCTGA ATTTGTGCAT
GGCTCGAAGG GAACTTCAAA TCCTTCCGGT TCGATTACAG CCGCCGGGAA CGATTGGCGT
TATCGCGGGC CGCGCCCCAA TCCTTACGTG GTCGAGCACG ACGACTTGCT CAAGGCCATT
AAGGGTGGTG TTGCCTACAA CGAAGCCGAT AACGGCGCTT TCAGCACGAT GACATCGATC
CTGGGCCGCA TGGCGACATA CTCAGGGAAA GTCATTGAGT GGAACGATGC CCTCAACTCA
CAGATCAGTC TCTTCCCCAA GGAACTGAGC TGGGATGCCG ACATGCCTGT GAAGCCCGAT
AGCGAAGGCA ATTATCCCAT TGCCACGCCA GGTGTGACCA AAGTGGTCTA A
 
Protein sequence
MDQLNGSRRE FLATTGMVAG AALAAQAGLA TSAYAQGDDV IRIGLVGSGG RGTGAARDAL 
STDHNVKLVA VGDVFEDRAN RAVGSIKGGL GDKGSKVDVA PDKVFFGFDA FKKVIDSGVD
LVILATPPGF RPQQFEYAIN AGKHVFMEKP VATDAPGVRQ VLAASKLAKE KNLKVGCGLQ
RHHDIGYIET INRIKDGQIG DVIAMRAYWN NAGVWEPPLK RENAKSEMEY QMRNWYYYNW
LCGDHIVEQH IHNLDVCNWV KGDYPVRANG MGGRQVRVDK RYGEIYDHHC VEFEYKDGSR
TYSQCRHIPN TWNQVSEFVH GSKGTSNPSG SITAAGNDWR YRGPRPNPYV VEHDDLLKAI
KGGVAYNEAD NGAFSTMTSI LGRMATYSGK VIEWNDALNS QISLFPKELS WDADMPVKPD
SEGNYPIATP GVTKVV