Gene EcE24377A_4159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4159 
Symbol 
ID5587366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4147081 
End bp4148790 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content54% 
IMG OID640927777 
ProductAsmA family protein 
Protein accessionYP_001465137 
Protein GI157159023 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTA TTGGGAAGCT GCTTCTCTAC ATTCTCATCG CTCTGTTAGT GGTGATCGCT 
GGCCTCTATT TTCTTCTGCA AACCCGCTGG GGAGCAGAAC ATATCAGCGC ATGGGTTTCC
GAGAATAGCG ACTATCATCT GGCCTTCGGG GCGATGGATC ACCGTTTTTC CGCGCCATCT
CATATCGTGC TGGAGAACGT CACGTTTGGT CGTGATGGTC AGCCCGCGAC CCTGGTGGCA
AAAAGTGTCG ACATTGCGCT AAGCAGTCGG CAACTGACCG AACCACGCCA TGTCGATACC
ATCCTGCTGG AAAACGGGAC GCTGAATCTC ACCGACCAGA CCGCGCCGCT ACCGTTCAAA
GCCGATCGTC TGCAACTGCG TGATATGGCG TTTAATAGCC CGAATAGCGA ATGGAAACTG
AGCGCGCAGC GGGTAAATGG CGGCGTGGTT CCGTGGTCAC CAGAAGCCGG TAAAGTGCTG
GGTACGAAGG CGCAGATTCA GTTTAGTGCC GGATCGCTTT CGCTCAATGA TGTTCCTGCC
ACCAATGTAC TGATTGAAGG CAGTATTGAT AATGATCGCG TTACGCTGAC TAACCTGGGT
GCCGACATCG CCCGCGGGAC ATTAACCGGA AACGCGCAGC GTAACGCCGA CGGCAGCTGG
CAAGTGGAAA ACCTGCGCAT GGCGGATATA CGTCTACAAA GCGAAAAATC GCTAACCGAC
TTCTTTGCGC CATTACGCTC TGTCCCGTCG TTGCAGATTG GTCGCCTGGA AGTGATCGAT
GCTCGTTTGC AAGGTCCGGA CTGGGCGGTG ACCGACCTCG ATCTCAGCTT GCGCAACATG
ACCTTCAGTA AAGATGACTG GCAGACACAA GAAGGCAAAC TGTCGATGAA CGCTAGAGAG
TTCATTTATG GTTCGCTGCA TTTATTTGAC CCGATTATAA ACGCGGAATT TTCCCCGCAG
GGCGTAGCGC TGCGCCAGTT CACCAGCCGC TGGGAAGGGG GTATGGTCAG AACGTCAGGG
AACTGGCTGC GTGACGGGAA AACGTTGATC CTTGATGATG CGGCAATTGC CGGGCTGGAA
TATACCTTGC CGAAAAACTG GCAACAGTTG TGGATGGAAA CGACACCCGG TTGGTTAAAC
AGCCTGCAAC TGAAGAGATT TAGCGCCAGC CGCAATCTGA TCATTGATAT CGACCCTGAC
TTCCCGTGGC AGCTCACCAC GCTCGATGGT TACGGTGCCA ACCTGACGCT GGTTACCGAT
CATAAATGGG GCGTCTGGAG TGGCTCGGCG AATCTGAATG CCGCCGCCGC GACATTCAAT
CGTGTTGATG TTCGTCGCCC GTCGCTGGCG CTGACCGCCA ACAGCAGCAC GGTGAATATC
AGCGAACTGA GTGCATTTAC TGAAAAAGGC ATTCTGGAAG CCACCGCCAG TGTTTCACAA
ACGCCACAAC GTCAGACACA TATCAGCCTG AATGGACGCG GTGTGCCGGT GAATATTTTG
CAACAGTGGG GATGGCCTAA ATTACCGTTG ACTGGCGACG GCAATATTCA GCTTACCGCC
AGTGGCGATA TTCAGGCCAA TGTCCCGTTG AAACCTACGG TTAGCGGGCA ATTGCATGCC
GTGAACGCCG CAAAGCAGCA AGTGACTCAA ACCATGAATG CTGGCATCGT TTCCAGCGGT
GAAGTTACAT CGACGGAGCC GGTGCGGTAA
 
Protein sequence
MKFIGKLLLY ILIALLVVIA GLYFLLQTRW GAEHISAWVS ENSDYHLAFG AMDHRFSAPS 
HIVLENVTFG RDGQPATLVA KSVDIALSSR QLTEPRHVDT ILLENGTLNL TDQTAPLPFK
ADRLQLRDMA FNSPNSEWKL SAQRVNGGVV PWSPEAGKVL GTKAQIQFSA GSLSLNDVPA
TNVLIEGSID NDRVTLTNLG ADIARGTLTG NAQRNADGSW QVENLRMADI RLQSEKSLTD
FFAPLRSVPS LQIGRLEVID ARLQGPDWAV TDLDLSLRNM TFSKDDWQTQ EGKLSMNARE
FIYGSLHLFD PIINAEFSPQ GVALRQFTSR WEGGMVRTSG NWLRDGKTLI LDDAAIAGLE
YTLPKNWQQL WMETTPGWLN SLQLKRFSAS RNLIIDIDPD FPWQLTTLDG YGANLTLVTD
HKWGVWSGSA NLNAAAATFN RVDVRRPSLA LTANSSTVNI SELSAFTEKG ILEATASVSQ
TPQRQTHISL NGRGVPVNIL QQWGWPKLPL TGDGNIQLTA SGDIQANVPL KPTVSGQLHA
VNAAKQQVTQ TMNAGIVSSG EVTSTEPVR