Gene EcHS_A3298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3298 
Symbol 
ID5592208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3304179 
End bp3305489 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content56% 
IMG OID640922416 
Producthypothetical protein 
Protein accessionYP_001459910 
Protein GI157162592 
COG category[S] Function unknown 
COG ID[COG3681] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGATT CGACTTTAAA TCCGTTATGG CAGCGTTACA TCCTCGCCGT TCAGGAGGAA 
GTAAAACCGG CGTTGGGATG TACTGAACCG ATTTCACTGG CGCTGGCGGC GGCGGTTGCT
GCGGCAGAAC TGGAAGGTCC GGTTGAACGT GTAGAAGCCT GGGTTTCGCC AAATCTGATG
AAGAACGGTC TGGGCGTCAC CGTTCCCGGC ACGGGAATGG TGGGGCTGCC GATTGCGGCG
GCGCTGGGGG CGTTAGGTGG AAATGCCAAC GCCGGGCTGG AAGTGCTGAA AGACGCAACT
GCGCAGGCAA TTGCCGATGC CAAAGCACTG CTGGCGGCGG GGAAAGTCTC CGTTAAGATC
CAGGAACCTT GCAATGAAAT CCTCTTCTCA CGCGCCAAAG TCTGGAACGG TGAGAAGTGG
GCGTGTGTCA CCATCGTCGG CGGGCATACC AACATTGTGC ATATTGAGAC GCACGATGGT
GTGGTGTTTA CCCAGCAGGC GTGTGTGGCA GAGGGCGAGC AAGAGTCTCC GCTGACGGTG
CTTTCCAGAA CGACGCTGGC TGAGATCCTG AAGTTCGTCA ATGAAGTCCC GTTTGCGGCG
ATCCGCTTTA TTCTCGATTC TGCGAAGCTA AATTGTGCGT TATCGCAGGA AGGTTTGAGC
GGTAAGTGGG GGCTGCATAT TGGCGCGACG CTGGAAAAAC AGTGCGCGCG CGGTTTGCTG
GCGAAAGATC TCTCTTCATC CATTGTGATT CGTACCAGCG CGGCATCCGA TGCGCGTATG
GGCGGCGCTA CGCTTCCGGC AATGAGTAAC TCTGGCTCGG GTAACCAGGG GATCACCGCA
ACAATGCCTG TGGTGGTGGT AGCAGAACAC TTCGGAGCGG ATGATGAACG GCTGGCGCGT
GCGCTGATGC TTTCGCATTT GAGCGCAATT TACATCCATA ACCAGTTACC GCGTTTGTCT
GCACTTTGTG CCGCAACGAC CGCAGCAATG GGGGCCGCCG CCGGGATGGC ATGGCTGGTG
GATGGGCGTT ATGAAACTAT CTCGATGGCG ATCAGCAGTA TGATCGGCGA TGTCAGCGGC
ATGATTTGCG ATGGAGCGTC GAACAGCTGT GCGATGAAGG TTTCGACCAG TGCTTCGGCT
GCGTGGAAAG CGGTGTTAAT GGCGCTGGAT GATACCGCCG TGACCGGCAA TGAAGGGATC
GTGGCGCATG ATGTTGAGCA GTCGATTGCC AACCTGTGTG CGTTAGCAAG CCATTCGATG
CAGCAAACCG ATCGGCAGAT TATCGAGATT ATGGCGAGCA AGGCCAGATA A
 
Protein sequence
MFDSTLNPLW QRYILAVQEE VKPALGCTEP ISLALAAAVA AAELEGPVER VEAWVSPNLM 
KNGLGVTVPG TGMVGLPIAA ALGALGGNAN AGLEVLKDAT AQAIADAKAL LAAGKVSVKI
QEPCNEILFS RAKVWNGEKW ACVTIVGGHT NIVHIETHDG VVFTQQACVA EGEQESPLTV
LSRTTLAEIL KFVNEVPFAA IRFILDSAKL NCALSQEGLS GKWGLHIGAT LEKQCARGLL
AKDLSSSIVI RTSAASDARM GGATLPAMSN SGSGNQGITA TMPVVVVAEH FGADDERLAR
ALMLSHLSAI YIHNQLPRLS ALCAATTAAM GAAAGMAWLV DGRYETISMA ISSMIGDVSG
MICDGASNSC AMKVSTSASA AWKAVLMALD DTAVTGNEGI VAHDVEQSIA NLCALASHSM
QQTDRQIIEI MASKAR