Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_3366 |
Symbol | |
ID | 5112350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 3669455 |
End bp | 3670513 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640493572 |
Product | adenine DNA glycosylase |
Protein accession | YP_001178079 |
Protein GI | 146313005 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0209119 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATGC ATGCCTCTCA ATTTTCAGCC CAGGTGCTGG ACTGGTACGA CAAATACGGG CGTAAAACCC TGCCCTGGCA AATTGAAAAA ACACCGTACA AAGTATGGCT CTCTGAGGTG ATGTTGCAAC AAACGCAGGT CGCGACGGTT ATTCCTTATT TTGAGCGCTT TATGACGCGC TTCCCGACCA TCACCGATCT CGCCAATGCG CCCTTAGACG AAGTGCTGCA CCTGTGGACG GGCCTCGGCT ATTACGCCCG CGCGCGCAAT CTGCACAAAG CCGCGCAACT GGTGGCAACG ACGCATCAGG GTAAATTCCC CGAAACGTTT GAAGAAGTGG CTGCGCTGCC TGGCGTTGGG CGTTCCACCG CAGGCGCGGT GCTGTCCCTT TCCCTGGGCA AACATTTTCC GATTCTCGAC GGCAACGTGA AACGCGTGCT GGCCCGCTGT TATGCGGTGG ATGGCTGGCC GGGTAAAAAA GAGGTCGAAA AGCGTTTATG GGAAATCAGC GAAGCCGTCA CACCGGCAAA GGGCGTTGAG CGTTTTAACC AGGCGATGAT GGATTTGGGT GCGATCGTGT GCACGCGTTC AAAACCAAAA TGCGAGCTTT GCCCGGTCAA CAACCTCTGC ATGGCCTACG CGAATCATTC ATGGGCGCAA TATCCGGGTA AAAAGCCCAA GCAGACAATT CCGGAACGCA CCGGGTATAT GTTGCTGATG CAGCACGATG ATGAAGCGTA TCTCGCCCAG CGTCCGCCGA GCGGTTTGTG GGGCGGATTA TTCTGTTTCC CGCAGTTCGA ATCCGAAGAA GGGCTGCGTC AGTGGCTGGC AGATCGTGGA ATCAACGCCG ATAATCTCAC GCAACTGACT GCATTTCGCC ACACGTTTAG CCATTTCCAT TTAGATATTG TGCCAATGTG GCTTCCCGTG TCCTCATTCG CCTCATGCAT GGATGAAGGA ACGGGTCTCT GGTATAACTT AGCGCAACCG CCATCAGTCG GGCTGGCCGC TCCGGTGGAG CGCCTGTTAC AACAATTACG TGCCGGAGCA GTGGTTTAG
|
Protein sequence | MTMHASQFSA QVLDWYDKYG RKTLPWQIEK TPYKVWLSEV MLQQTQVATV IPYFERFMTR FPTITDLANA PLDEVLHLWT GLGYYARARN LHKAAQLVAT THQGKFPETF EEVAALPGVG RSTAGAVLSL SLGKHFPILD GNVKRVLARC YAVDGWPGKK EVEKRLWEIS EAVTPAKGVE RFNQAMMDLG AIVCTRSKPK CELCPVNNLC MAYANHSWAQ YPGKKPKQTI PERTGYMLLM QHDDEAYLAQ RPPSGLWGGL FCFPQFESEE GLRQWLADRG INADNLTQLT AFRHTFSHFH LDIVPMWLPV SSFASCMDEG TGLWYNLAQP PSVGLAAPVE RLLQQLRAGA VV
|
| |