Gene EcHS_A4029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4029 
Symbol 
ID5591742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4020945 
End bp4022270 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content46% 
IMG OID640923133 
Producthypothetical protein 
Protein accessionYP_001460599 
Protein GI157163281 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGGTC AAATAATCAC AGTTGCACTC ATCCTGTTGG GGCACATATT TCTGACGCCT 
GCTGTACAGG CCATAGGGTT CGACTATTAC AATGATCACG GTGTTATGTC TTACGGTAAA
GGTTATGGGG AGGATGAGAA AATCATTGCC CAATTTCCGA AGATGAACAG AGCCGATTTG
CGCCTTGTGA CAAATATATC CGGGGAGCGG GAGTTGGTGG AAGGCTATAT ACCCACCGAT
GAAATAAGCA TAAAAAATGA GGAATACCAC TGGGTGACAG ACGGCCGCGT TATTCTCTGG
CGTGGCAAAA TCGTTAGCAA TCCACCGGGA ACCCCCACTG TCGATATTGC CAGCTTTCAG
GCTATGGGCC GCTTCGCGGT CGATAAATAT AGCCTCTATT TTGACGGACA GCGCACCGAA
AGTAATAGCG GCGCGTCACG TGTTGATCTG GCGACACTAA AAGCTATCGA AGGTAACTCT
ACCACGCTGA TGGATAGTAA AAATCTCTAC TTGTCCGGTC GACGTCAGGG GAGCAGCAGT
GATGTTACTG TATTAGAAAA AAGATGGTGG GGTATTAATC CACGTCTTAT GAGTGTAAAC
AGAAATTCGT ATTCCAATGA TCTGCTTATC CGCAGTGGGC AGAATATTTA TTTAAATGGC
GTTCACCTTA CGGCGAATGC AGACTCATTT GAGATAATTC GCTGGATACC TCATTCACTG
CTGGTTTTTC GCGACAATAA GGGTCTGCAT CGTTATCCCT TTGGTCAATT ATCAGGCAAA
GCGATACCCG TAGATGATGA CGTCTCTTTT GAAGTAGGGG AAAGTCGCGT TCGCTGGCGT
AAACAGCTCA CGCCCGACCG TCAGTGGAGC AAGTGGATAG ACCTACCAGG TATTGAACCT
GAACAATTTC ATCTGATTAC TGGCAATATT GCGCAATATA AAGATCGGCT GTATGTAACA
AAATTATCGA CATTTGGTGA AGACCAGCTT GAGATAATCC CGCTGGATAC GCCAGACCTG
GTCATTGATC GCTCATTTAA TAGCGGCAAA CAGCATGCTT ACTTTATCCG CCAATTACGG
TCAAAGAGCT TGCAAATTAT TCCAGTTAAC GGTCCGCTAA CTAAAAACGA TCGCTTCGCT
TATGACGATC GCAATGTTTA TACATGGACC GATACAGAGG TAAGGATTAC GCCCTCCCCC
TGCCCGGCGA AAACTCGTGT CAGAGAGGAG AACGTACGTG AAGTTCAAAA CAGAGACATC
ATTATTCCGG TGACGGATGA ATCATGCCGG AACGCAGCAG CAGAGGTGCA AACTTTGAAG
CCCTGA
 
Protein sequence
MNGQIITVAL ILLGHIFLTP AVQAIGFDYY NDHGVMSYGK GYGEDEKIIA QFPKMNRADL 
RLVTNISGER ELVEGYIPTD EISIKNEEYH WVTDGRVILW RGKIVSNPPG TPTVDIASFQ
AMGRFAVDKY SLYFDGQRTE SNSGASRVDL ATLKAIEGNS TTLMDSKNLY LSGRRQGSSS
DVTVLEKRWW GINPRLMSVN RNSYSNDLLI RSGQNIYLNG VHLTANADSF EIIRWIPHSL
LVFRDNKGLH RYPFGQLSGK AIPVDDDVSF EVGESRVRWR KQLTPDRQWS KWIDLPGIEP
EQFHLITGNI AQYKDRLYVT KLSTFGEDQL EIIPLDTPDL VIDRSFNSGK QHAYFIRQLR
SKSLQIIPVN GPLTKNDRFA YDDRNVYTWT DTEVRITPSP CPAKTRVREE NVREVQNRDI
IIPVTDESCR NAAAEVQTLK P