Gene Cpin_0937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_0937 
Symbol 
ID8357051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp1119936 
End bp1121000 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content48% 
IMG OID644963091 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_003120636 
Protein GI256419983 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGT TTTTTACGAA TGCACTCCTG GAATGGAACG ATAACGAGAA TACGCGTTCT 
ATGCCATGGA AAGGTGAGAA AGATCCCTAC CGGATATGGC TGTCTGAAAT TATACTCCAG
CAAACACGTG TGGAACAGGG ATGGGCCTAC TACGAGAAGT TTATCTTAAA CTATCCGACC
GTTCAGGAAC TGGCCGCTGC GCCGGAAGAA GCAGTGTTCC GCCTCTGGCA GGGACTGGGT
TATTATGCCC GCTGTAAAAA TATGCTGGCA GCCGCCAAAC AGATCGCATC TCAGTATCAC
GGTCATTTCC CGAATACATA CGAAACCATA CAATCGCTCA AAGGTGTAGG CCCCTATACA
TCCGCGGCTA TCGCCTCCTT TGCATTCAAC CTGCCACACG CCGTACTGGA TGGCAACGTA
TTCCGCGTGC TGTCCCGGTT CTTTGATATA GATACACCCA TCGATACCAC AGCTGGTAAA
AAGCAATTTA CCGATCTGGC ACAGGAACTG CTTCCGCACG GAAAATCAGC TTCGTATAAC
CAGTCCATTA TGGACTTTGG CGCCGTTGTA TGCAAACCAC AGCAACCCGC CTGTAAAAGC
TGTCCGCTTG CCGCAAAATG CAAAGGCTAC CAGCAGGGAC TCACCGCGCT GTTACCCGTA
AAATCCAAAA AGCTGGTCAT TAAAAAGCGT TACTTCTACT ACCTGGTACT ACAGCATAAA
GAGAATGTCT ATATCCGGAA ACGTACGGAA AATGATATCT GGCAGAACCT CCATGAGTTC
ATTCTTATAG AAACTCCTGG TCCCGAAGAT CCAGGCTCCT TACTATCCTC TGCAGCCTTC
AAAGCAGTCA TGAAAGATAT ACGTTACAAC ATGGATGGGG CTTCCGCTAC CTTTAAACAA
CAGCTTACAC ACCAGACAAT CCATAGCCAG TTTCTCCTGC TTTCAGTCAG CAAAAAGCCC
GAAATCCCCG GCTACACCGC CGTCCCAAGG GATCAACTGG ATCTTTACGC CTTTCCTAAA
ACCATCACCG ACTTTCTCAG AAACAGGGAA CTTACCCTCT TTTGA
 
Protein sequence
MKQFFTNALL EWNDNENTRS MPWKGEKDPY RIWLSEIILQ QTRVEQGWAY YEKFILNYPT 
VQELAAAPEE AVFRLWQGLG YYARCKNMLA AAKQIASQYH GHFPNTYETI QSLKGVGPYT
SAAIASFAFN LPHAVLDGNV FRVLSRFFDI DTPIDTTAGK KQFTDLAQEL LPHGKSASYN
QSIMDFGAVV CKPQQPACKS CPLAAKCKGY QQGLTALLPV KSKKLVIKKR YFYYLVLQHK
ENVYIRKRTE NDIWQNLHEF ILIETPGPED PGSLLSSAAF KAVMKDIRYN MDGASATFKQ
QLTHQTIHSQ FLLLSVSKKP EIPGYTAVPR DQLDLYAFPK TITDFLRNRE LTLF