Gene EcHS_A0940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0940 
Symbol 
ID5593844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp938781 
End bp940532 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content56% 
IMG OID640920110 
Producttail fiber domain-containing protein 
Protein accessionYP_001457677 
Protein GI157160359 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones63 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACAA AATTTTATAC CCTGCTGACG GATATTGGCG CGGCGAAACT TGCCAGCGCC 
GCCGCGCTCG GTGTGCCGTT AAAAATTACC CATATGGCGG TCGGCGATGG CGGCGGAACA
TTACCAACGC CGGACGCAAA GCAGACAGCA CTGGTAAATG AGAAACGCCG GGCTGCGCTG
AATATGCTCT ATATCGACCC GCAGAACAGC AGCCAGATTA TTGCTGAACA GGTGATCCCT
GAAAACGAGG GCGGTTGGTG GATACGTGAA GTGGGCCTGT TTGATGAGTC CGGGGCATTG
ATTGCCGTGG GCAACTGCCC GGAAAGCTAT AAGCCGCAAC TGGCTGAAGG CAGCGGGCGT
ACCCAGACCG TGCGCATGGT GCTGATTACC AGCAGTACGG ACAATATCAC CCTGAAAATC
GACCCTGCTG TAGTGCTGGC AACCCGCAAG TATGTGGATG ACAAAATATC AGAGCACGAA
CAGTCACGAC GTCACCCGGA CGCCTCGCTG ACCGCAAAGG GTTTTACTCA GTTAAGCAGT
GCGACCAACA GTGAATCCGA AATACTGGCC GCAACACCGA AGGCTGTGAA GGCGGCATAT
GATCTTGCAG CAGGTAAAGC ATCCGCCAGT CACACACACC CGTGGAATCA GATAACGGAT
GTGCCTGCAG CTTCACTGAC GGTAAAAGGC ACCGTGCAAC TCAGCAGCGC CACTAACAGC
ACGTCAGAAA CGCAGGCTGC CACACCAAAG GCAGTGAAGG CGGCATATGA CCTTGCAGCA
GGTAAGGCAC CTGTCAGTCA CACGCACCCG TGGAGCCAGA TAACAGATGT GCCTGCAGCT
TCACTGACGG TAAAAGGCAC CGTGCAACTC AGCAGCGCTA CTAACAGCAC GTCAGAAACG
CAGGCTGCCA CACCAAAAGC CGTGAAGGCT GTATATGACC TTGCCAATGG AAAACAACCT
GCCGACGCCA CACTGACCGC ACTGGCAGGC CTTGCCACTG CGGCAGACAA ACTTCCGTAT
TTTACGGGGA ATGATACAGC CAGCCTGACA ACCCTGACTA ACGTTGGACG GAATATTCTG
GATAAAGCAA GCACACAGGC GGTTATTCAA TATCTTGGTC TGAGCGATGC AAGTGGATAC
GTTGGACGCT GGCTGAATAC CCAGGTTTTC ACCTCATCAG GTACGTACAC CCCGACGCCA
GGAACAAAAC GGATTAGGGT CACAATAACG GGCGGCGGTG GCGGAGGGGG CGGCTGCAAG
GCTATATCCA ATAATGAAAC GTTTTTCGGT GCTGGCGGCG GGGCAGGTGG GACAGTAATC
ACCACGCTGA TCCTGACGAA GGATAGTTAT CCTGTCACTA TCGGCGCAGG TGGGGCCGGC
GGCGTTAGTG CGACGAACGG CCTCAAGGGC GGTGATAGCT CGTTCGGATC GGTAATAGCC
CCTGGTGGTG AAGGTGGTGG AAAATCAGGA GTCACAAACA CGAACGGTGG TAACGGCGGT
GTGCCAAGTA CTGGCGGTAT CAACATCATT GGTGGAAATG GAGGCGACGG TCAGTCCGGA
AATATCGGCG TCAGCGGTGA AGGCGGAACA TCGCACTGGG GTGGCGGTGG ACGCGCAGGC
GCTGGCGGTG GTGTTAGTGG TAAGGCATAT GGTTCAGGTG GCGGTGGCGC ATACGATGCC
GGTTATAGCG GAACCAGTAT GACAGGCGGG AAAGGTGCCG CTGGGATTTG TATTATCGAG
GAGTTTGCAT AA
 
Protein sequence
MSTKFYTLLT DIGAAKLASA AALGVPLKIT HMAVGDGGGT LPTPDAKQTA LVNEKRRAAL 
NMLYIDPQNS SQIIAEQVIP ENEGGWWIRE VGLFDESGAL IAVGNCPESY KPQLAEGSGR
TQTVRMVLIT SSTDNITLKI DPAVVLATRK YVDDKISEHE QSRRHPDASL TAKGFTQLSS
ATNSESEILA ATPKAVKAAY DLAAGKASAS HTHPWNQITD VPAASLTVKG TVQLSSATNS
TSETQAATPK AVKAAYDLAA GKAPVSHTHP WSQITDVPAA SLTVKGTVQL SSATNSTSET
QAATPKAVKA VYDLANGKQP ADATLTALAG LATAADKLPY FTGNDTASLT TLTNVGRNIL
DKASTQAVIQ YLGLSDASGY VGRWLNTQVF TSSGTYTPTP GTKRIRVTIT GGGGGGGGCK
AISNNETFFG AGGGAGGTVI TTLILTKDSY PVTIGAGGAG GVSATNGLKG GDSSFGSVIA
PGGEGGGKSG VTNTNGGNGG VPSTGGINII GGNGGDGQSG NIGVSGEGGT SHWGGGGRAG
AGGGVSGKAY GSGGGGAYDA GYSGTSMTGG KGAAGICIIE EFA