Gene Acid345_4298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4298 
Symbol 
ID4071871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5107490 
End bp5108935 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content58% 
IMG OID637986331 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_593372 
Protein GI94971324 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.15389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTACTC GTCTTCCAGT CATCATTCAA GGCGGTATGG GCGCTGGCGT TTCAGGTTGG 
CGCCTGGCGC GCGCAGTTTC TTCCAAAGGA CAACTTGGCG TGGTTTCCGG CACAGCCCTG
GATGTGCTCA TGGCACGCCG TCTGCAGGAC GGTGATCAGG GCGGACATGT CCGCTTCGCT
CTGAAGCATT TTCCGTCGCA GGAGATCGCG GCGCGAATCA TTGACCGACA CTTCATCGAA
GGCGGGAAAG CTCCTGATGC GCCATACAAG CCTGTCCCAA TGCATTCGCC GCGACCGAAC
AGTGAGCTCA TTCAGCTGAC GATCGCAGCG AACTTTGTAG AAATCTTCCT CGCGCGACAC
GGACATAAAA ACCCTGTTGG CATCAACTAC CTCGAAAAGC TGCAACTGCC GCACCTTCCT
TCCATTTACG GAGCGATGCT CGCCGGCGTT GAGTACATTC TGATGGGCGC GGGAATCCCG
CTGAAAATCC CCGGCGTGCT CGACGCCTTC GTGAATCATG GAGCAGCGGA GTATCCGCTG
GCCGTCGCAG GTGCACAGGA CGGTGACGAT TTCAAAGCGA CGTTTGATCC TCGCGAGTAT
CTCGACATCG ACTTGCCGCC ACTCACGCGA CCGAAGTTCC TGCCAATTAT TGCTTCCAAC
GTGCTCGCTC TCACGTTGAT GAAGAAGGCG AACGGGCGTG TGGATGGATT CATCGTTGAA
GGTCCCACTG CAGGCGGGCA TAACGCGCCT CCGCGCGGGA AAATGCAATT TGACGCCGAT
GGCGAAGTCA TCTACGGCGA ACGCGATGCT GTTGATCTCG AGAAACTCCG CGCTCTTGGC
CTGCCGTTCT GGCTAGCGGG CGGATACGGT TCGCCAGAGA AGCTGGAAGA AGCATTAGAG
AACGGAGCCA CGGGCGTCCA GGTCGGCACT CCGTTTGCAT TGACCGACGA ATCGGACTTG
GAACCTGTGA GCAAGCGCGA ACTTCTCCGC CAGGTTAGGG AGGGTACCGC GCAGGTGCGC
ACGGACGTCA CCGCTTCGCC TACGGGCTTT CCCTTCAAAG TTGCGCAACT GTCGAATTCA
CAAACCAACA CGGCAATTTA CGAAGCGCGC GAACGCATCT GTGACCTCGG CTTTCTACGC
GAAGGCTATC GCACGCTCAC AGGAACGCTG GACTATCGCT GTGCCGCGGA ACCGGAATAC
GTGTTCCTTG CAAAAGGTGG CGACATAGAA CAGACGAAGG GCCGTAAGTG CCTTTGCAAC
GCGTTGGTAG CGAACATCGG AAAACCACAA CTGCGTGCGG AAGGCCGTCT TGAGCCAACC
CTGATCACCA TGGGCAACGA CCTCGTAGGA ATCGGCAGAT TCCTGCGTCC AGACCACGAA
GGCTATTCGG CGGCTGACGT GATTCAGCAC TTACTAACGG GCGTAGCGGC AGAGGCTATC
GCTTAG
 
Protein sequence
MGTRLPVIIQ GGMGAGVSGW RLARAVSSKG QLGVVSGTAL DVLMARRLQD GDQGGHVRFA 
LKHFPSQEIA ARIIDRHFIE GGKAPDAPYK PVPMHSPRPN SELIQLTIAA NFVEIFLARH
GHKNPVGINY LEKLQLPHLP SIYGAMLAGV EYILMGAGIP LKIPGVLDAF VNHGAAEYPL
AVAGAQDGDD FKATFDPREY LDIDLPPLTR PKFLPIIASN VLALTLMKKA NGRVDGFIVE
GPTAGGHNAP PRGKMQFDAD GEVIYGERDA VDLEKLRALG LPFWLAGGYG SPEKLEEALE
NGATGVQVGT PFALTDESDL EPVSKRELLR QVREGTAQVR TDVTASPTGF PFKVAQLSNS
QTNTAIYEAR ERICDLGFLR EGYRTLTGTL DYRCAAEPEY VFLAKGGDIE QTKGRKCLCN
ALVANIGKPQ LRAEGRLEPT LITMGNDLVG IGRFLRPDHE GYSAADVIQH LLTGVAAEAI
A