Gene Ent638_4213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_4213 
Symbol 
ID5110327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009425 
Strand
Start bp23246 
End bp24544 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content48% 
IMG OID640480830 
ProductGntR family transcriptional regulator 
Protein accessionYP_001165492 
Protein GI146284539 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0990086 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTA ACGGTAAAAC AGCGGGTGAT ATTTTCGAAC AGATTCGCGC AATGATTCAG 
GCTGGCGAAC TTGTTCCCGG CATGATGCTT CCCACGCTTC GTGAACTTGC AGGTGAATTG
GGTATCAATC GTAATACGGT TGCCCTGGCT TATAAGCGAC TTACGGACGC AGGTTTTGTC
ATATCCAGAG GCCGAAATGG AACTATTGTA CGTGAACACA TTATGCTCAC TGATATTGAA
GGTAGTGCGT CTAACCTTGT GCTTCGTGAC CTCGCCAGCG GCAACCCTGC AGTCACACTT
CTGCCTTCAA TGAGCCAACT TGCCGTGCAT ATCCAAAACA CGCCGGGTCT GTACGGTGAA
ACCGTGATCC GCCCTGAACT TGAGGCTCTG GGGCTTGAAT GGTTAAAACA AGATATAGGA
TCACCTTTTG ATTTAAATCT GACTAACGGT GCTGTTGATG CTATTGAAAA GGTACTTACC
AGCTATTTGA TTGCTGGAGA TCGTGTGGCT GTTGAAGACC CTTGCTTCCT CAGTAGCATC
AGCACTCTGC GCCATAACCG CTTTCAGGTT GCTCCTGTGG AAATAGATGT CGAAGGGATG
AAAATTGAGT CGCTTTCACG GCAGCTTTCT GCTGGCGTGA AAGCGGTGAT TATTACTCCA
CGTGCCCACA ATCCTACCGG TTTTAGCCTC AGTTTTAAGC GTGCTGAAGG TATTCGTACT
CTCCTGGCTA GTCATCCTCA CGTTCTGGTG ATAGTGGATG ATCACTTTTC ATTGCTCTCA
ACACATGACT ACTACCACAT TGTTCCGGGA AATACCCGCA ACTGGGTACT CATCCGTTCC
ATGTCAAAGA GCTTGGGACC TGATTTACGT ATGGCCTTTG TCGCAAGCGA TGCAGATACC
TCACAGCGCT TGCGCCTGCG CCTGAATTCG GGGACTAACT GGGTAAGTCA TATTTTGCAG
GATATGGTTG TTGGTCATAT GCATTCATCT GGCTTCCAGA AGTCAATCTT GTCCGCACGT
GAGAGTTATT TTGAAAAAAG AGAGTTGATG GTCAATGCGC TAAAACAACA TGGCGTCAAA
GTTCCAGACC ATCATGATGG CCTCAATGTG TGGATTCCGT TGACACAGAA CAGCGCTCCA
ATTGTTATGC AAATGGCCCA ACGGGGTTGG CTTATTCGGG GTGGGGAAGG TTTCAACCTG
AACAATTCTG GCTCCGGGGT GCGAATCACC ATTTCAGATC TTGATGCAAC GGAAACTAAA
CTAATCGCGA AATCTCTGGC TGAGATACTG ACACAATAG
 
Protein sequence
MNINGKTAGD IFEQIRAMIQ AGELVPGMML PTLRELAGEL GINRNTVALA YKRLTDAGFV 
ISRGRNGTIV REHIMLTDIE GSASNLVLRD LASGNPAVTL LPSMSQLAVH IQNTPGLYGE
TVIRPELEAL GLEWLKQDIG SPFDLNLTNG AVDAIEKVLT SYLIAGDRVA VEDPCFLSSI
STLRHNRFQV APVEIDVEGM KIESLSRQLS AGVKAVIITP RAHNPTGFSL SFKRAEGIRT
LLASHPHVLV IVDDHFSLLS THDYYHIVPG NTRNWVLIRS MSKSLGPDLR MAFVASDADT
SQRLRLRLNS GTNWVSHILQ DMVVGHMHSS GFQKSILSAR ESYFEKRELM VNALKQHGVK
VPDHHDGLNV WIPLTQNSAP IVMQMAQRGW LIRGGEGFNL NNSGSGVRIT ISDLDATETK
LIAKSLAEIL TQ