Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_0152 |
Symbol | |
ID | 5114035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 174987 |
End bp | 176165 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640490309 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001174893 |
Protein GI | 146309819 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGAGA AGCGTCACCG CATTACGTTG TTATTCAATG CCAACAAAGC CTATGACCGT CAGGTCGTGG AAGGCGTGGG TGAATATTTG CAGGCGTCGC AATCCGAATG GGACATCTTC ATAGAAGAAG ATTTTCGTGC CAGCATCGAG ACGATTAAAG ACTGGTTAGG CGATGGGGTT ATCGCAGATT TTGATGACCC CGATATCCAA AAAATGCTTG TCGATGTCGA CGTCCCCATT GTGGGCGTCG GCGGTTCTTA CCACTCGCCC GAAAAATATC CCCCGGTACA TTACATCGCG ACCGATAACT ACGCGCTGGT CGAAAGCGCG TTTTTGCATT TAAAAGAAAA AGGTGTTCAC CGCTTTGCCT TTTATGGTTT GCCAGCCTCC AGCGGTAAGC GATGGGCGAT GGAGCGCGAA CACGCCTTTT GTCAGATTGT GGCCCAGGAA AAGTATCGTG GCGTCGTTTA TCAAGGGCTC GAGACGGCAC CTGAAAACTG GCAGCACGCG CAAAACCGTC TGGCCGACTG GCTGCAAACG CTGCCACCGC AAACCGGCAT TATTGCGGTC ACAGACGCGC GTGCGCGTCA TGTACTCCAG GTGTGCGACC ATTTGCACAT TCCGGTTCCT GAAAAACTGT GCGTCATTGG CATCGATAAC GAAGAGTTAA CCCGTTATTT GTCACGCGTC GCGCTGTCAT CAGTGGCGCA GGGAACGCGT CAGATGGGCT ATCAGGCCGC GAAATTACTT CACCGATTGC TGGATAACGA AACCCTGCCG CTGCAGCGTC TGCTGGTGCC ACCGGTCCGT GTGGTGGAAC GGCGCTCCAC CGATTACCGT TCTTTAAACG ATCCCGCGGT GATTCAGGCG ATGCACTACA TCCGTAACCA TGCCTGCAAG GGCATCAAAG TGGATCAGGT GTTGGATGCC GTGGGCATTT CTCGCTCGAA TCTGGAGAAG CGTTTTAAAG AAGAGGTCGG GGAAACGATT CATGCGGTCA TTCACGCCGA AAAACTGGAA AAAGCGCGCA GTCTGTTAGT GTCGACGTCA TTGTCGATCA ACGAGATTTC GCAGATGTGC GGCTACCCAT CGTTACAGTA TTTCTATTCG GTGTTTAAAA AAGATTATGA CACCACGCCG AAAGAGTATC GGGAGCGGTA CAGCGAGGTG CTGATTTAG
|
Protein sequence | MFEKRHRITL LFNANKAYDR QVVEGVGEYL QASQSEWDIF IEEDFRASIE TIKDWLGDGV IADFDDPDIQ KMLVDVDVPI VGVGGSYHSP EKYPPVHYIA TDNYALVESA FLHLKEKGVH RFAFYGLPAS SGKRWAMERE HAFCQIVAQE KYRGVVYQGL ETAPENWQHA QNRLADWLQT LPPQTGIIAV TDARARHVLQ VCDHLHIPVP EKLCVIGIDN EELTRYLSRV ALSSVAQGTR QMGYQAAKLL HRLLDNETLP LQRLLVPPVR VVERRSTDYR SLNDPAVIQA MHYIRNHACK GIKVDQVLDA VGISRSNLEK RFKEEVGETI HAVIHAEKLE KARSLLVSTS LSINEISQMC GYPSLQYFYS VFKKDYDTTP KEYRERYSEV LI
|
| |