Gene Rru_A3787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3787 
Symbol 
ID3837244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp4337976 
End bp4339373 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content66% 
IMG OID637827912 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_428868 
Protein GI83595116 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAGATA TCAAACCGCT CCTGATATCG GGCAAGGAAG TGTATCCCCT GGTCGAAGGC 
GGCAAGGGCG TTGCGGTGAC GAACGGCCGC AGCTCGGGCG CCTGGGCGGC CGCTGGTGGT
GTTGGCACCA TCAGCGCGGT GAATGCCGAT TTTTATGACG AGACCGGCGC CCTGGCCAAT
CAGGTTTACA AGGGGCGGAC GCGCAGCGAA CGCCACCGCG AGCTGATCGA CTTCGGCATC
AAGGGCGGCA TCGTCCAGGC CCGCATCGCC CACGAGGAAG CGCGCGGCGA AGGGCGGATG
CACATCAACG TCCTGTGGGA AATGGGCGGG GCCCGCGAGG TTCTCGAAGG CGTGCTCGAA
GGCGCCAAGG GCCTGGTCCA TGGCGTGACC TGCGGCGCCG GCATGCCCTA TGGCGTGGCC
GAGATCGCCG CCCGCCACCG CGTCTATTAC TATCCCATCG TGTCGTCGGC CCGCGCCTTC
CGCGCCCTGT GGAAGCGGGC CTATAGCAAG GCCCCCGAAT GGCTGGGCGG CGTCGTCTAT
GAGGATCCCT GGCTGGCCGG CGGCCACAAT GGCCTGTCCA ACAGCGAAAA CCCGCGCGAA
CCCCAGCCGC CCCTGCCGCG CGTCGCCGAA TTGCGCGCCC AGATGCGCGC CGTCGGCGCT
CCCGAGGTGC CGATCATCAT GGCCGGCGGC GTGTGGTATC TGCGCGAATG GGCCGAATGG
CTTGAAAACC CCGAGCTGGG GCCGATCGCC TTCCAGTTCG GCACCCGGCC GCTGCTGACC
CAGGAAAGCC CGATTTCCGA CGAGTGGAAG CAGCGCCTGC TGACCTTGCG TCCGGGCGAC
GTGTTGCTCC ATCGCTTCAG CCCGACGGGG TTCTACTCCT CGGCCGTGCG CAATGACTTC
CTTCAGGAAC TGGTCGAGCG CTCCAACCGC CAGATCACCT ATTTCACCGA GCCCCAGGGC
CAGCACACCA CCAGCTTCGC CGTCGGCCCG CGCGCCCGCG AGGTGTTCGT GCGCGCCGAG
GATAGCGTTC TGGCCCATGC CTGGGTCGCC CAGGGCTTCA CCGAAGCCAT GCGCACGCCC
GATAACTCGC TGATCTTCGT CACGCCCGAG CGCGCCGAGC GCATCAAGAC CGACCAGATC
AATTGCATGG GCTGCCTGTC GGCCTGCGGG TTCTCGAACT GGGCGGAGAA CGAGTTGAAC
AACACCGGCA AGCGCGCCGA CCCCCGCTCG TTCTGTATTC AAAAGACCCT CCAGGAAATC
GCCCACGGCC ACCCCGTCGA CCAGAACCTG ATGTTCGCCG GCCATAACGC CTTCCGCTTC
GCCACCGATC CGTTCTTCAC TTCGGGACGG ATTCCGACCA TGGGCGAGTT GGTCGAGCGC
ATCCTGACGG GCGATTGA
 
Protein sequence
MKDIKPLLIS GKEVYPLVEG GKGVAVTNGR SSGAWAAAGG VGTISAVNAD FYDETGALAN 
QVYKGRTRSE RHRELIDFGI KGGIVQARIA HEEARGEGRM HINVLWEMGG AREVLEGVLE
GAKGLVHGVT CGAGMPYGVA EIAARHRVYY YPIVSSARAF RALWKRAYSK APEWLGGVVY
EDPWLAGGHN GLSNSENPRE PQPPLPRVAE LRAQMRAVGA PEVPIIMAGG VWYLREWAEW
LENPELGPIA FQFGTRPLLT QESPISDEWK QRLLTLRPGD VLLHRFSPTG FYSSAVRNDF
LQELVERSNR QITYFTEPQG QHTTSFAVGP RAREVFVRAE DSVLAHAWVA QGFTEAMRTP
DNSLIFVTPE RAERIKTDQI NCMGCLSACG FSNWAENELN NTGKRADPRS FCIQKTLQEI
AHGHPVDQNL MFAGHNAFRF ATDPFFTSGR IPTMGELVER ILTGD