Gene Ent638_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2023 
Symbol 
ID5113439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2196518 
End bp2197534 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content54% 
IMG OID640492211 
ProductLacI family transcription regulator 
Protein accessionYP_001176750 
Protein GI146311676 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID[TIGR02417] D-fructose-responsive transcription factor 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.072894 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAAAAA CAAAACGCAT CACCATTAAA GACGTAGCAG AACTGGCGGG CGTATCGAAA 
GCGACCGCCA GTCTGGTTTT GAATGGTCGC AGCAAAGAAT TACGTGTCGC AGAAGAAACG
CGCGATCGCG TGCTTGCCAT TGCAAAACAG CATCACTACC AGCCCAGTAT TCATGCGCGA
TCGCTGCGGG ATAATCGTAG CCATACCATC GGACTGGTCG TGCCAGAAAT CACCAACTAC
GGCTTTGCTG ATTTTTCACA TGAGCTGGAG ACGTTGTGCC GCGAAGCTGG CGTCCAGTTG
CTTATCTCCT GTACGGACGA AAATCCGGGG CAAGAAACCG TGGTGGTCAA CAATATGGTT
TCCCGCCAGG TCGATGGCTT GATTGTTGCC TCGAGCATGT TGAATGACAC CGACTACCAA
AAGCTGAGCG AACAACTGCC CATCGTGCTG TTTGACCGGC ATATGAATGA CAGTTCGTTA
CCGCAGGTGA TTACCGACTC CATTACGCCA ACCCGTGAAC TCGTCGCCGA CATCGCTCGG
CAGCATCCGG ATGAAATCTA TTTTCTCGGA GGGCAGCCGC GGCTTTCGCC CACGCGCGAT
CGCTTAGAAG GATTCAAACA GGGGTTAGCG CAGGCGGGCG TCACGTTGCG TCCGGAATGG
ATTATTCACG GGAACTATCA TCCAAGTTCC GGCTACGAGA TGTTCGCCGC GCTGTGCGCG
CAGTTGGGGC GGCCACCGAA GGCCGTTTTC ACTGCTGCCT GTGGCTTACT CGAAGGGGTG
TTGCGCTACA TGGGCCAGCA CAATCTGTTG CAAAGTGATA TGCGACTGGC CAGTTTTGAC
GATCACTATC TTTATGATTC TCTGGCCATC CCGATTGATA CGATACGACA GGATAATCGC
CAACTGGCGT GGCACTGCTT TGATTTGATT GGCAAGTTGA TTGAAGGGGA CGTTCCTGAT
CCGCTGCAAC GCAAGCTCGA TGCAACGCTT CAACGGCGGC ATAAAACGGC AGGGTGA
 
Protein sequence
MRKTKRITIK DVAELAGVSK ATASLVLNGR SKELRVAEET RDRVLAIAKQ HHYQPSIHAR 
SLRDNRSHTI GLVVPEITNY GFADFSHELE TLCREAGVQL LISCTDENPG QETVVVNNMV
SRQVDGLIVA SSMLNDTDYQ KLSEQLPIVL FDRHMNDSSL PQVITDSITP TRELVADIAR
QHPDEIYFLG GQPRLSPTRD RLEGFKQGLA QAGVTLRPEW IIHGNYHPSS GYEMFAALCA
QLGRPPKAVF TAACGLLEGV LRYMGQHNLL QSDMRLASFD DHYLYDSLAI PIDTIRQDNR
QLAWHCFDLI GKLIEGDVPD PLQRKLDATL QRRHKTAG