Gene EcHS_A2508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2508 
Symbol 
ID5591925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2519428 
End bp2520573 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content45% 
IMG OID640921629 
Producthypothetical protein 
Protein accessionYP_001459162 
Protein GI157161844 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.0122799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAATA ATGAAAGCAA AGGCCCGTTT GAAGGCTTAT TAGTTATCGA TATGACACAT 
GTCCTTAATG GACCTTTCGG AACTCAACTT CTTTGTAATA TGGGCGCAAG GGTAATTAAA
GTTGAGCCGC CGGGTCATGG TGATGATACC CGCACATTTG GTCCCTATGT GGATGGACAG
TCACTCTATT ACAGTTTTAT TAATCATGGC AAAGAGAGTG TGGTTCTTGA TTTAAAGAAT
GATCACGATA AAAGTATATT TATAAATATG CTCAAACAAG CTGATGTATT AGCTGAGAAT
TTTCGCCCAG GTACAATGGA AAAACTGGGG TTTTCATGGG AAACGCTTCA AGAAATCAAC
CCGCGCCTCA TATATGCTTC ATCGTCAGGT TTCGGACATA CCGGTCCGCT AAAAGATGCT
CCTGCCTACG ATACCATCAT TCAGGCAATG AGCGGGATAA TGATGGAAAC AGGATATCCT
GATGCTCCGC CAGTGCGCGT TGGTACATCT CTTGCGGATC TATGCGGCGG TGTCTATTTA
TTCAGCGGAA TAGTGAGTGC ACTTTATGGC CGTGAAAAGA GCCAGAGAGG GGCGCATGTC
GATATAGCGA TGTTTGATGC CACGCTGAGT TTTCTGGAGC ATGGTCTGAT GGCATATATC
GCGACAGGGA AGTCACCACA ACGTCTGGGA AATCGCCATC CCTACATGGC ACCTTTTGAT
GTTTTCAATA CTCAGGATAA GCCGATTACG ATTTGTTGTG GTAATGACAA GCTTTTTTCT
GCGTTATGCC AGGCACTGGA GCTTACGGAA CTGGTTAATG ATCCCCGATT TAGCAGCAAT
ATTTTACGCG TACAAAACCA GGCTATTCTT AAACAATATA TTGAGCGGAC GTTAAAAACG
CAGGCAGCTG AAGTTTGGTT AGCCAGAATA CATGAAGTTG GTGTACCCGT CGCGCCGTTA
TTAAGTGTGG CTGAGGCCAT TAAATTGCCA CAAACTCAGG CGAGAAATAT GTTGATTGAA
GCCGGGGGAA TAATGATGCC GGGTAATCCG ATAAAAATCA GCGGCTGCGC GGACCCGCAT
GTTATGCCGG GAGCGGCAAC GCTCGACCAG CATGGGGAAC AAATTCGCCA GGAGTTCTCA
TCATAA
 
Protein sequence
MTNNESKGPF EGLLVIDMTH VLNGPFGTQL LCNMGARVIK VEPPGHGDDT RTFGPYVDGQ 
SLYYSFINHG KESVVLDLKN DHDKSIFINM LKQADVLAEN FRPGTMEKLG FSWETLQEIN
PRLIYASSSG FGHTGPLKDA PAYDTIIQAM SGIMMETGYP DAPPVRVGTS LADLCGGVYL
FSGIVSALYG REKSQRGAHV DIAMFDATLS FLEHGLMAYI ATGKSPQRLG NRHPYMAPFD
VFNTQDKPIT ICCGNDKLFS ALCQALELTE LVNDPRFSSN ILRVQNQAIL KQYIERTLKT
QAAEVWLARI HEVGVPVAPL LSVAEAIKLP QTQARNMLIE AGGIMMPGNP IKISGCADPH
VMPGAATLDQ HGEQIRQEFS S