Gene Bind_2091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2091 
Symbol 
ID6200518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2390353 
End bp2391441 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content61% 
IMG OID641706077 
Product2-nitropropane dioxygenase NPD 
Protein accessionYP_001833201 
Protein GI182679055 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAT GGCCCGACAC ACGAATCCTC GATCTCTTCG GCATTGAGCT CCCCATCATC 
CAGGCACCGA TGGCCGGGGC CACGACTCCC GAGATGGTCA TCGCGGTCAG CGAGGCTGGA
GGGCTCGGCT CATTGCCCAG CGCTCAATAT ACGGTCGAGC AGCTTCGCGC GGCTTTGGAA
AGCATACGTT CTGGTACATC CCGGCCGATC AATGTCAATT TCTTCTGCCA CGTCATGCCC
GCTGACGATC CCGCGCGGCA AATGGCCTGG CGGGCCAGGC TCGCGCCCTA TTACGTGGAG
GCTGGGCTCG ATCCAGCCGT CCCCCCGCCC GTTACGGGCC GCGCCCCCTT TGATGACGCT
TTCTGTCAGG TCGTCGAGGA ATTTCGGCCG GAAGTCGTCA GTTTCCATTT TGGCCTGCCG
GAAACTTCTT TGTTGGAACG GGTGAAAAAG ATCGGCGCCA AGGTCATATC CTCTGCCACG
ACTGTCGCCG AGGCCCTCTG GCTCAATGAG CGTGGCATTG ATGCCATTAT CGCCATGGGT
TTCGAAGCCG GCGGTCATCG CGGCAATTTT CTGACCCAGG ATATGGCGAC CCAAGTCGGG
ACCATCGCGC TCGTGCCACA AATCGTCGAT GCCGTTCGCG TCCCCGTCAT CGCGGCCGGC
GGCATCGCCG ATGGGCGCGG CATAGCAGCC GCTTTCATGC TCGGGGCTTC GGCTGTGCAG
ATCGGCACCT CCTATCTTTT TACACAGGAA GCGAAAATCC CCGCGATCCA CAGGGAAGCC
TTGGAATCCG CCACCGACGA CAATACGGCG CTGACCAATC TCTTCACCGG TCGGCCTGCG
CGCGGCATTC TCAACCGAAT CATGCGCGAG ATCGGCCCGC TTTCGGATCT TGCTCCCGCT
TTTCCAACCG CAGGCGGGGC ACTCGCCCCC TTAAAGAGCC ATGCGGAAGC CAAGCAATCG
GGGGATTTCA CCAACCTTTG GTCAGGTCAG GCCGCGCGCC TGGCGCATAA GCTCCTGCCT
GCCGGAACTT TGACCCGACA ATTGGCGGAT GAGGCAGAAA AGCGCCTGTA CCCTGGCGGT
GCACATTGA
 
Protein sequence
MTTWPDTRIL DLFGIELPII QAPMAGATTP EMVIAVSEAG GLGSLPSAQY TVEQLRAALE 
SIRSGTSRPI NVNFFCHVMP ADDPARQMAW RARLAPYYVE AGLDPAVPPP VTGRAPFDDA
FCQVVEEFRP EVVSFHFGLP ETSLLERVKK IGAKVISSAT TVAEALWLNE RGIDAIIAMG
FEAGGHRGNF LTQDMATQVG TIALVPQIVD AVRVPVIAAG GIADGRGIAA AFMLGASAVQ
IGTSYLFTQE AKIPAIHREA LESATDDNTA LTNLFTGRPA RGILNRIMRE IGPLSDLAPA
FPTAGGALAP LKSHAEAKQS GDFTNLWSGQ AARLAHKLLP AGTLTRQLAD EAEKRLYPGG
AH