Gene Sbal195_4010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_4010 
Symbol 
ID5755829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp4720021 
End bp4721070 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content54% 
IMG OID641290356 
Product2-nitropropane dioxygenase NPD 
Protein accessionYP_001556430 
Protein GI160877114 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTGCC AGTTAACCCG ATTATTTGGA ATCGAATTGC CAATTATTCA AGCGCCTATG 
GCGGGTGTTC AGGGCAGTGC CTTAACGATA GCCGTGTCGC AAGCGGGTGC GTTGGGATCT
TTGCCCTGCG CCATGTTATC CCTTGAAGCC TTAGATGCCG CGTTGATACA GGTCCGCGCC
CAGACGACTA AGCCCATCAA TGTAAACTTC TTTTGTCATC ACGAGCCCGA ACCGCAAGCG
GCGAAACAAG CTGCATGGCT CAAGCAACTC GCGCCTTATT TTGCTGAGCT GGGTATCCAT
TCTGATACCC ACGCTGCGGG AGCACAACGC GCGCCCTACA CGAGTGCTCA GGCTGAAGTG
TTAGCAAAAT TCAAGCCTGA GGTTGTGAGT TTTCACTTTG GTTTACCCAA TGAAGACTTG
TTGCTGGAGA TAAAATCTTG GGGATCTAAG ATCATCTCGA CGGCGACCAC AGTGGAAGAG
GCGCTCTGGT TAGAAGCGCG TGGTGTAGAT GCTATTATTG CGCAGGGCTT AGAAGCTGGC
GGCCATAGAG GACATTTCTT ATCCGATGAT TTGACTGAGC AGCTCGGCAC TTTTAGTTTA
TTGCCGCAAA TCATTGCTGC CGTTGATGTG CCCGTTATCG CAGCGGGTGG CATAGTCGAT
GCAAAAACCG TGCGTGCCGC CATGGCGATG GGCGCTTCGG CGGTGCAAGT TGGGACGGCC
TATTTACTTT GCCCTGAATG CACCACCAGT GAGATCCATC GCGCCGCACT GCAAAGCGAG
GCGGTGCAGC ATACGGCTCT GACTAACTTA TTTTCTGGTC GCCCCGCACG CGGCATAGTG
AATCGTTTTA TGGCTGAACT TGGACCTATC AATAACGCAG TACCGGATTT TCCGCTCGCC
TCCAGTGCCG TCGCAGTATT GCGAAGCGCC GCCGAGCAAC AAGGGTTCGG TGATTTTAGC
CCGCTCTGGT GCGGCCAAAA TGCAGCCGGC TGTCAGGCGA TTCCCGCCGC AGAGCTCACC
AAGCAATTGG CGTTAGGCTT GATAGTTTAA
 
Protein sequence
MPCQLTRLFG IELPIIQAPM AGVQGSALTI AVSQAGALGS LPCAMLSLEA LDAALIQVRA 
QTTKPINVNF FCHHEPEPQA AKQAAWLKQL APYFAELGIH SDTHAAGAQR APYTSAQAEV
LAKFKPEVVS FHFGLPNEDL LLEIKSWGSK IISTATTVEE ALWLEARGVD AIIAQGLEAG
GHRGHFLSDD LTEQLGTFSL LPQIIAAVDV PVIAAGGIVD AKTVRAAMAM GASAVQVGTA
YLLCPECTTS EIHRAALQSE AVQHTALTNL FSGRPARGIV NRFMAELGPI NNAVPDFPLA
SSAVAVLRSA AEQQGFGDFS PLWCGQNAAG CQAIPAAELT KQLALGLIV