Gene Shewana3_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_0474 
Symbol 
ID4476703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp550973 
End bp552034 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content56% 
IMG OID639725008 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_868121 
Protein GI117918929 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTGCC GACTAACACA ATTGTTTGGG ATCCAGTTTC CGATTATTCA AGCGCCGATG 
GCGGGCGTGC AGGGCAGTGC ACTGGCGATT GAAGTATCGC AGGCGGGCGG ATTGGGCTCC
TTGCCCTGCG CTATGTTATC CCTCGAGGCA CTAGAAGCTG AGTTAACCGA AATCCGAGCC
AACACCACTA AATCTATCAA TGTGAATTTC TTTTGCCATA GCGAGCCTTT ACCGCAAGCG
GCCAAGCAAG CGGCTTGGCT CGAACAGTTA GCACCCTATT TTGCGGAATT TAATCTCGAC
CCAAATGCGC AGCCTGCTGG CGCCCAGCGC ACACCCTACA GCAAGGCGCA GGCTGAGGTG
TTAGCCAAAT TTAAGCCCGA GGTGGTGAGT TTTCATTTTG GGCTGCCCGA TGAAGAATTG
CTGCTGGAAA TCAAATCCTG GGGCTCAAAA GTTATCTCCG CGGCGACCAC AGTCGAGGAG
GCGCTCTGGC TCGAGGCTCG CGGCGCCGAT GCGATTATTG CCCAAGGCTT GGAGGCGGGT
GGGCATAGAG GGCACTTTTT ATCCGAGGAT TTAACCGAGC AGCAGGGGAC CTTTAGTTTA
TTACCGCAGG TGATTGCGGC TGTCGATATT CCGGTGATTG CCGCAGGTGG CATTGTCGAT
GCCACCACAG TGCGCGCTGC CATGGCAATG GGGGCTTCGG CGGTGCAGGT GGGGACAGCT
TATTTACTCT GCCCCGAGTG CAACACCAGC AGCATTCATC GTGAGGCGCT GCAAAGCGAC
GCTGCGCAGC ATACGGCACT GACGAACTTA TTTTCTGGTC GACCCGCCCG CGGCATAGTG
AACCGCTTTA TGGCCGAAAT GGGGCCGATA AATGAGGCCG TGCCAGATTT CCCCTTGGCA
TCCTCGGCGG TTGCAGGCTT AAGAACGGCA GCGGAGAGGC AAGGATTTGG AGATTTTAGC
CCGCTATGGT GCGGACAAAA TGCCAGTGGT TGCCAAAACA TTCCCGCAGC CGAGTTGACG
CGGCAGTTAG CTTTAGGCTT GATGGGGTCA TTATCTGGCT GA
 
Protein sequence
MPCRLTQLFG IQFPIIQAPM AGVQGSALAI EVSQAGGLGS LPCAMLSLEA LEAELTEIRA 
NTTKSINVNF FCHSEPLPQA AKQAAWLEQL APYFAEFNLD PNAQPAGAQR TPYSKAQAEV
LAKFKPEVVS FHFGLPDEEL LLEIKSWGSK VISAATTVEE ALWLEARGAD AIIAQGLEAG
GHRGHFLSED LTEQQGTFSL LPQVIAAVDI PVIAAGGIVD ATTVRAAMAM GASAVQVGTA
YLLCPECNTS SIHREALQSD AAQHTALTNL FSGRPARGIV NRFMAEMGPI NEAVPDFPLA
SSAVAGLRTA AERQGFGDFS PLWCGQNASG CQNIPAAELT RQLALGLMGS LSG