Gene Sare_1611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1611 
Symbol 
ID5703466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1842326 
End bp1843783 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content69% 
IMG OID641271120 
Product2-nitropropane dioxygenase NPD 
Protein accessionYP_001536495 
Protein GI159037242 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0274193 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAACTTC CGATCATCAT TCAGGGCGGA ATGGGTGTCG GCGTCTCCAG CTGGCGGCTG 
GCGGCAGCGG TATCAGCCGC AGGCCAGCTC GGAGTGGTGT CCGGAGTGGC ACTGGACGCG
TCCCTGGCCC GCCGGCTCCA GCTCGGCGAC GAGGACGGCA CGCTACGGCA GGCACTGGCC
GCATTCCCCG TGCCGGAACT CGCCCAGCGA GTGCTGGACC GCTACTACGT GCCGGGCGGA
ATCCCCGCGG GAAGACCGTT CCGACCGGCA CCGCTGCTGA GTATGCGGCC ACGCCGACAT
GCCAACGAGC TTGCCGTGGT CGCCAACTTC GTCGAGGTAC ACCTGGCCAA GCAGGGCCAC
GACGGTGTCA TCGGAATCAA CTACCTGGAA AAGATCCAAC TTGCTACCCC CGCCGCGGTG
TACGGAGCGA TGTTGGCCGG TGTCGACTAC ATCCTGATGG GGGCCGGCCT GCCCAGTGAG
ATTCCGTCAC TGATCGACGC CCTGAGCCGC CACCAGCCGG TCCGGTTGCC CGTCACCGTC
GACGGGGGCC AATCGGGCGA AACCTATACG GTTGCCTTCG ACCCGCCCGA CCTGGCCGGT
GACCTACCGC CCCTCCCCCG GCCGCGGTTC CTGGCCATCG TCTCCGCCGC GTCCCTGGTC
AGCTACCTTG CGCGCAGTCC TCGTACGCGC CCCGACGGCT TCGTCCTCGA AGGGGCCACC
GCAGGTGGTC ACTCGGCGCG GCCACGGGGC AGGATGGTCC TCGACGACAA CGGCGAACCC
GTCTACGGTG AGCGCGACCG GCTCGACCTG GCCAAGGTAG CCGCATCCGG GGCGCCGTTC
TGGGTTGCCG GCGGACAGGC CGACCCACGA CGGTTGGCCA CAGCCCAAGC AGCCGGGGCC
ACCGGCATTC AGGTCGGTAC CGCATTCGCC CTGTGTCGCG AATCGGGAAT CAACCCCCGG
TTGCGGCACC AGGTGCTCCA GCAGGCAATC GGCGGGCAGC TCGCAGTCCG CAACGATCCG
GCCGCCTCCC CGACCGGCTT CCCGTTCAAG ATCGCCCAAC TGGACGGCAC CGCCGCCGAG
GAATCTGTGT ATCGTTCCCG GACCCGCCGG TGCGACCTGG GATACCTGCG CACCCCGTAC
CTGCGGCCGA CTGGCCGGAT CGGGTTCCGG TGCATGGCAG AGCCGGTTGA GGACTACATC
CGCAAGGGCG GCGCAGCCGA GGACACCACG GGGAGCCGTT GCCTGTGCAA CGGGCTGATG
GCCACGATCG GCCTGGGCCA ACGACGCGGT GGCGGCGAGG TTGAGCCACC GCTGGTCACC
CTCGGCCAGG ACATCCGCGT GCTGACCGAA CTGCACCAAC GCTTCGGCGA CGATTACACG
GCCAGGGACG TCCTGCGCTA CCTGACCGCC GTGGACGGCC ACCAGACCGA CGCCGGCGAA
CCGGCGGGTG CGGGGTGA
 
Protein sequence
MELPIIIQGG MGVGVSSWRL AAAVSAAGQL GVVSGVALDA SLARRLQLGD EDGTLRQALA 
AFPVPELAQR VLDRYYVPGG IPAGRPFRPA PLLSMRPRRH ANELAVVANF VEVHLAKQGH
DGVIGINYLE KIQLATPAAV YGAMLAGVDY ILMGAGLPSE IPSLIDALSR HQPVRLPVTV
DGGQSGETYT VAFDPPDLAG DLPPLPRPRF LAIVSAASLV SYLARSPRTR PDGFVLEGAT
AGGHSARPRG RMVLDDNGEP VYGERDRLDL AKVAASGAPF WVAGGQADPR RLATAQAAGA
TGIQVGTAFA LCRESGINPR LRHQVLQQAI GGQLAVRNDP AASPTGFPFK IAQLDGTAAE
ESVYRSRTRR CDLGYLRTPY LRPTGRIGFR CMAEPVEDYI RKGGAAEDTT GSRCLCNGLM
ATIGLGQRRG GGEVEPPLVT LGQDIRVLTE LHQRFGDDYT ARDVLRYLTA VDGHQTDAGE
PAGAG