Gene Shewmr4_0476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_0476 
Symbol 
ID4250900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp541423 
End bp542484 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content56% 
IMG OID638117035 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_732613 
Protein GI113968820 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTGCC GACTAACACA ATTGTTTGGG ATCCAGTTTC CGATTATTCA AGCCCCGATG 
GCGGGCGTGC AGGGTAGTGC ACTGGCGATT GAAGTATCGC AGGCAGGTGG ATTGGGCTCC
TTGCCCTGCG CCATGTTATC CCTCGAGGCG CTTGAGGCTG AGTTAACCGA AATCCGCAGC
AATACCACTA AACCTATCAA TGTAAATTTC TTTTGCCATA GTGAGCCTTT AGCGCAGGCG
GCCAAGCAAG CGGCTTGGCT TGAACAGCTC TCGCCTTATT TTACTGAATT TAATTTCGAT
CCGAATGCGC AGCCCGCTGG CGCCCAGCGC ACCCCCTACA GCAAGGCGCA GGCCGAGGTG
TTAGCCAAAT TTAAGCCCGA GGTGGTGAGT TTTCATTTTG GGCTGCCCGA TGAAGAATTG
CTGCTGGAAA TCAAATCCTG GGGCTCAAAA GTTATCTCCA CGGCGACTAC AGTCGAGGAG
GCTCTTTGGC TCGAGGCCCG TGGCGCCGAT GCGATTATTG CCCAAGGTCT AGAGGCGGGC
GGGCACAGAG GGCACTTTTT ATCCGAGGAT TTAACCGAGC AGCAGGGGAC TTTTAGTCTA
TTACCGCAGG TGATTGCGGC AGTGGATATT CCGGTGATTG CCGCCGGCGG CATTGTCGAT
GCCACTACAG TGCGCGCCGC CATGGCGATG GGCGCTTCGG CGGTGCAAGT GGGAACAGCC
TATTTACTCT GCCCCGAGTG CAATACCAGC AGCATCCATC GTGAGGCGCT GCAAAGCGAC
GCCGCGCAGC ATACGGCACT GACTAATTTA TTTTCCGGTC GACCCGCTCG CGGCATAGTG
AACCGCTTTA TGGCCGAGAT GGGGCCGATG AATGAAGCTG TGCCAGATTT CCCCTTGGCA
TCCTCGGCCG TTGCAGGCTT AAGAACGGCG GCGGAGCAAC AAGGATTTGG CGATTTTAGT
CCGCTATGGT GCGGCCAAAA TGCCAGTGGT TGCCGAAACA TTCCCGCAGC CGAGCTGACG
AGGCAGTTGG CTTTAGGTGT GATGGGAGCA TTATCTGGCT GA
 
Protein sequence
MPCRLTQLFG IQFPIIQAPM AGVQGSALAI EVSQAGGLGS LPCAMLSLEA LEAELTEIRS 
NTTKPINVNF FCHSEPLAQA AKQAAWLEQL SPYFTEFNFD PNAQPAGAQR TPYSKAQAEV
LAKFKPEVVS FHFGLPDEEL LLEIKSWGSK VISTATTVEE ALWLEARGAD AIIAQGLEAG
GHRGHFLSED LTEQQGTFSL LPQVIAAVDI PVIAAGGIVD ATTVRAAMAM GASAVQVGTA
YLLCPECNTS SIHREALQSD AAQHTALTNL FSGRPARGIV NRFMAEMGPM NEAVPDFPLA
SSAVAGLRTA AEQQGFGDFS PLWCGQNASG CRNIPAAELT RQLALGVMGA LSG