Gene Cpin_4574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4574 
Symbol 
ID8360747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5718407 
End bp5719489 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content51% 
IMG OID644966729 
Product2-nitropropane dioxygenase NPD 
Protein accessionYP_003124217 
Protein GI256423564 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGTATC AGACAAAAGC ATCGGAATTA TTAGGCATTA CGTATCCTGT TTTACAGGGG 
CCTTTTGGCG GTAATCTGTC CTCGGTGGAA TTGGTGGCTA CAGTGTCAAA TGCAGGCGGA
TTAGGCGGCT TTGGAGCCTA TTCTATGAGT CCGCAGGAGA TCGCAGCCCT GGATGAGCAG
ATCAGAAACG CCACTGATAA ACCGTATAAT ATCAATCTGT GGGTGAATGA TACAGATGCG
GTTGATGGCA CGGTTACAGA CGAACAGTTT AGACACACGC AGGACGTCTT CCGGCCTTAT
TTCGATCAAC TGGGTATTGC ATTACCTGAG AAGCCAGGAC CATTCCGCTC ACGGTTTGAA
GACCAGGTAG CCGTGATCCT GGACCGGAAA CCACCGGTAT TCAGCTTTAT GTTTGGTATT
CCATCCGCAG ATATACTGGA ACAATGTCGT AAGGCCGGTA TTGTCACAAT CGGCGCCGCT
ACCACACCGG ATGAAGCGAT CGCGTTGGAA GCAGCAGGCG TAGATCTGAT CGTGGCTTCC
GGGTTTGAGT CTGGTGGACA CCGTCCTTCC TTCCTCCAAT CTGCAGAAGC TTCTACAACC
GGTACTTTCG TCCTGGTGCA ACTGATCAGA GAAAAGGTAA AGACCCCGGT GATCGCGGCA
GGAGGCATTG CCAATGGAAA GGGAATTGCT GCGGCACTGA CACTGGGAGC AGATGCGGTG
CAGATAGGTA CGGCCTTCCT GGCCTGTGAG GAATCAAATG CGACGCCTTT ACACAGGGAA
ATGCTGTTCT CTGATGCAGC CCGACAAACC ACATTGACCC GCGCCTTTAC CGGTAGGTTG
GGCCGGGGAA TCGCCAACAG GATTACTTCA ACGCTGGCGC CCGATACAAA GAACTTTCTG
CCCTTTCCTT TACAGACTAC CTTCCTGTCA TCTCTACGTA AAGCGGCGCT TGAAAAGGAA
CAGTGGGACA TGATCTATTT CTGGGGTGGT CAGATTGCCC CGTTATTAAA ACATAGAAAA
GCCGCCGGAT TGATGCAATC GCTGCTGGAA GAAACAACAG CGTATTTCGG AGGTAGAAAA
TAA
 
Protein sequence
MWYQTKASEL LGITYPVLQG PFGGNLSSVE LVATVSNAGG LGGFGAYSMS PQEIAALDEQ 
IRNATDKPYN INLWVNDTDA VDGTVTDEQF RHTQDVFRPY FDQLGIALPE KPGPFRSRFE
DQVAVILDRK PPVFSFMFGI PSADILEQCR KAGIVTIGAA TTPDEAIALE AAGVDLIVAS
GFESGGHRPS FLQSAEASTT GTFVLVQLIR EKVKTPVIAA GGIANGKGIA AALTLGADAV
QIGTAFLACE ESNATPLHRE MLFSDAARQT TLTRAFTGRL GRGIANRITS TLAPDTKNFL
PFPLQTTFLS SLRKAALEKE QWDMIYFWGG QIAPLLKHRK AAGLMQSLLE ETTAYFGGRK