Gene EcE24377A_4621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4621 
Symbol 
ID5590413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4620866 
End bp4622158 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content34% 
IMG OID640928237 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001465569 
Protein GI157158802 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTATA ATGGTTTAAA TAATATGTTT TTCTCTCTTT GCCAGATTAA CGATAACCAC 
TCTCTCACAA GTTCATCACA TACAAAGAAA ACAAAATCAT ATAATTACAG CAAACATCAT
AAAAACACGT TAATTGACAA TAAAGCCCTC TCTCTTTTCA AAATGGATGA TCATGAAAAA
GTGATAGGTT TGATTCAGAA AATGAAAAGA ATTTATGATA GTTTACCATC AGGAAAAATC
ACGAAGGAAA CGGACAGGAA AATACATAAA CATTTTATAG ATATAGCTTT ATATGCAAAT
AATAAATGTG ACGATAGAAT TACGAGAAGA GTTTACCTTA GTAAAGAAAA GGAAGTATCC
ATTAAGGTGG TATATTTTAT AAATAATGTC GCCGCCCATA ATAATACTAT CGAAATTCCA
CAGACAGTAA ATGGTGGTTA CGATTTTTCA CACCTTAGCC TGAAAGGTAT CGTGATTAAA
GATGAAGATT TATCCAATTC GAATTTTGCA GGTTGCAGAC TACAAAACGC TATTTTCCAG
GACTGTAATA TGTATAAAAC GAATTTTTAT TACGCCATAA TGGAAAAAAT ACTTTTTGAT
AATTGTATTC TCGATGACTC AAATTTCGCT CAGATAAAAA TGGCCGACGG AACTCTAAAT
GCATGCTCCG CTATGCATGT TCAATTCTAC AATGCAGCAA TGAATAGAGC CAATATTAAA
AACACCTTCC TTGACTATTC AAATTTTTAT ATGGCGTACA TGGCTGAGGT AAATCTTTAT
AAAGTAATAG CGCCATATGT TAATTTATTT AAAGCCGACC TTAGTTTCTC TAAACTCGAT
TTAATTAACT TTGAACATGC TGATCTGTCT CGCGTCAATC TGAACAAAGC AATCCTCCAG
AATATAAACT TAATTGATAG CAAACTCTTT TGTACGTGGC TGACAAATAC ATTCCTCGAA
ATGGTTATAT GTACCGGCTC TAATATGGCT AATGTTAATT TTAATAATGC CAATTTAAGC
AACTGCCATT TCAACTGTTC TATTTTAACA AAAGCCTGTA TGTTTAATAC CCGTCTCTAT
CGGGTTAATT TTGATGAGGC TAGCGTCCAG GGAATGGGCA TTTCCATTCT CCGTGGGGAG
GAAAATATCC CCATTGATAG TGATACCCTG GTAACACTAC AGAAATTCTT TGAAGAAGAT
TGTACCTCTC ATACTGGCAT GTCACAAACT GAGGATAATA TTAATGCAGT CGCTATGAAG
ATTACTGCAG ATATTATGCA ACACGCAGAT TGA
 
Protein sequence
MRYNGLNNMF FSLCQINDNH SLTSSSHTKK TKSYNYSKHH KNTLIDNKAL SLFKMDDHEK 
VIGLIQKMKR IYDSLPSGKI TKETDRKIHK HFIDIALYAN NKCDDRITRR VYLSKEKEVS
IKVVYFINNV AAHNNTIEIP QTVNGGYDFS HLSLKGIVIK DEDLSNSNFA GCRLQNAIFQ
DCNMYKTNFY YAIMEKILFD NCILDDSNFA QIKMADGTLN ACSAMHVQFY NAAMNRANIK
NTFLDYSNFY MAYMAEVNLY KVIAPYVNLF KADLSFSKLD LINFEHADLS RVNLNKAILQ
NINLIDSKLF CTWLTNTFLE MVICTGSNMA NVNFNNANLS NCHFNCSILT KACMFNTRLY
RVNFDEASVQ GMGISILRGE ENIPIDSDTL VTLQKFFEED CTSHTGMSQT EDNINAVAMK
ITADIMQHAD