Gene Acid345_3501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3501 
Symbol 
ID4072760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4131033 
End bp4132103 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content62% 
IMG OID637985524 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_592576 
Protein GI94970528 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGAGCC GCCTGCTCGA CCTGCTCGAT GTGCAACATC CGATCTTTCT TGCGCCGCTC 
GGCGGCGGGC CGTCCACCCC CGAACTGGCT TCCACGATAG GCAATTCCGG CGGCCTGGGA
GCCTTGGCAG CTGCCTATCT CACGCCAGAC CAGCTCATTT ATGACGTACA GAGTGCCCGT
AAGCGGACGG ATGCGCCGCT GAACGTCAAT TTATTCGCCG GGGGCTACCA CGCTTCGACG
CAGGACGATC CCCGCCCCAT GTTAGGGCTT CTGGAAGCCT CGCACCGCGA AGTAGGCCTG
CCAGAGCCAA ACCTTCCTGT CGTTCCGCCA GACCCATTCG ACCAGCAATT TGAGGCTCTT
CTATCTGCCA AACCGCGTGT TTTCAGCTTT ACCTTCGGAA TCCCGCTGGC GTCAGCCATC
CAGCGCGCCC AAAAACGGGG AATCCTCGTG TTCGGTACGG CGACTACCGT TCGTGAGGGT
CAGCTTCTTG CCTCTGCCGG TGTGGATGCC ATCGTCGCCC AGGGTGCCGA GGCCGGAGGC
CAGCGCGGCA CCTTCGACGT CTCCTTCGAG GAAGGACTGG TCCCGCTTCG CGCATTGGTC
GCCGGACTGG CAAACGCCGT CGCGCTGCCG GTAATCGCTT CTGGGGGCAT TATGAACGGC
CGAGAAATCG CCGAGATGCT GCGCCTGGGC GCCAGCGCTG TGCAGCTTGG AACCGTGTTT
CTGTGTACTC CCGAGGCTGG CACCTCTGCG CCCTACCGTA AAGCCCTTCT TGATGCCGAA
GAGGACCGCA CCAGGATCAC GTATGCATTC ACCGGCCGCG GAGCGCGCGG GATCGAGAAC
GCCTTTATGC GGCAAATGGC TGCACATCGC GATGCGATCC TGCCATTTCC CATGCAGAAC
CTGCTCACGC GCGATCTTCG CAAAGCTGCG ACCCAGCAGG GCAAACCGGA ATACCTATCA
TTGTGGGCTG GAACCGGGGT AGCGCAGATT CGCGCTGAAC CCGCCGCGCA GATCATGCGC
CGCTTGGTGG ACGAGATGCA GGAAGCACTT GGTGGGCCCG GTAGGATTTG A
 
Protein sequence
MRSRLLDLLD VQHPIFLAPL GGGPSTPELA STIGNSGGLG ALAAAYLTPD QLIYDVQSAR 
KRTDAPLNVN LFAGGYHAST QDDPRPMLGL LEASHREVGL PEPNLPVVPP DPFDQQFEAL
LSAKPRVFSF TFGIPLASAI QRAQKRGILV FGTATTVREG QLLASAGVDA IVAQGAEAGG
QRGTFDVSFE EGLVPLRALV AGLANAVALP VIASGGIMNG REIAEMLRLG ASAVQLGTVF
LCTPEAGTSA PYRKALLDAE EDRTRITYAF TGRGARGIEN AFMRQMAAHR DAILPFPMQN
LLTRDLRKAA TQQGKPEYLS LWAGTGVAQI RAEPAAQIMR RLVDEMQEAL GGPGRI