Gene EcHS_A1639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1639 
Symbol 
ID5593271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1662058 
End bp1664703 
Gene Length2646 bp 
Protein Length881 aa 
Translation table11 
GC content56% 
IMG OID640920787 
Productfibronectin type III domain-containing protein 
Protein accessionYP_001458343 
Protein GI157161025 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value0.994447 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGACCC ATCCGCGCTA CGGCATGGGG AAACGTCTTG GTGCGGCAGA TGTGGATAAA 
TGGGCGCTGT ATGTCATCGG CCAGTACTGC GACCAGTCGG TGCCGGACGG TTTTGGCGGC
ACGGAGCCGC GCATCACCTG TAATGCCTAC CTGACCACGC AGCGTAAGGC GTGGGATGTG
CTCAGTGATT TCTGCTCGGC GATGCGCTGT ATGCCGGTAT GGAACGGGCA GACGCTGACG
TTCGTGCAGG ACCGACCATC AGATAAGGTG TGGACCTATA ACCGCAGTAA TGTGGTGATG
CCGGATGATG GCGCGCCGTT CCGCTACAGC TTCAGCGCCC TGAAGGACCG CCATAATGCC
GTTGAGGTGA ACTGGATTGA CCCGAACAAC GGCTGGGAGA CGGCGACAGA GCTTGTTGAA
GATACGCAGG CCATTGCCCG TTACGGTCGT AATGTCACGA AGATGGATGC CTTTGGCTGT
ACCTGCCGGG GGCAGGCACA CCGCGCCGGG CTGTGGCTGA TTAAAACGGA ACTGCTGGAG
ACGCAGACCG TGGATTTCAG CGTGGGTGCT GAAGGGCTTC GCCATGTACC GGGCGATGTC
ATTGAAATCT GCGATGATGA CTATGCCGGT ATCAGCACCG GCGGGCGCGT GCTGGCGGTA
AACAGCCAGA CCCGGACGCT GACGCTCGAC CGTGAAATCA CGCTGCCATC TTCCGGCACC
ACGCTGATAA GCCTGGTTGA CGGGCAGGGG AGTCCGGTCA GCGTGGAGGT TCAGTCCGTC
ACCGACGGCG TGAAGGTAAA AGTGAGCCGT GTTCCTGACG GTGTTGCTGA ATACAGCGTA
TGGGGGCTGA AGCTGCCGAC GCTGCGCCAG CGACTGTTCC GCTGCGTGAG TATCCGTGAG
AACGACGACG GCACGTATGC CATCACCGCC GTGCAGCATG TGCCGGAAAA AGAGGCCATC
GTGGATAACG GGGCGCACTT TGACGGCGAC CAGAGCGGCA CGGTGAATGG TGTCACGCCG
CCAGCGGTGC AGCACCTGAC TGCCGAAGTC ACCGCAGACA GCGGGGAGTA TCAGGTACTG
GCCCGCTGGG ACACGCCGAA GGTGGTGAAG GGGGTGAGCT TTATGCTTCG CCTGACCGTG
GCAGCGGACG ACGGCAGTGA GCGGCTGGTC AGCACGGCCC GGACGACAGA AACCACATAC
CGCTTCACGC AACTGGCGCT GGGACGGTAC ACGCTGACAG TCCGGGCGGT AAATGCGTGG
GGACAGCAGG GCGATCCGGC GTCGGTATCG TTCCGGATTG CCGCACCGGT AGCACCGTCG
CGGATTGAGC TGACGCCGGG CTATTTTCAG ATAACTGCCA CGCCGCATCT TGCGGTTTAT
GATCCGACGG TACAGTTTGA GTTCTGGTTC TCGGAAACGC GGATTACCGA TATCAGGCAG
GTTGAAACCA CAGCCCGCTA CCTTGGCGCG GGGCTGTACT GGATAGCCGC CAGTATCAAT
ATCAAACCGG GCCATAATTA TTATTTTTAC GTTCGCAGTG TGAACACCGT TGGCAAATCG
GCATTCGTGG AGGCTGTTGG TCAGCCGAGT GATGACGCAT CCGGCTATCT GGATTTTTTC
AAAGGCGAGA TAGGGAAAAC CCATCTGGCT CAGGAGCTGT GGACGCAGAT TGATAACGGT
CAGCTTGCGC CTGACCTGGC TGAAATCAGG ACATCCATTA CGGATGTCAG CAATGAAATC
ACACAGACCG TCAATAAGAA ACTGGAAGAC CAGAGTGCAG CGATCCAGCA GATACAGAAG
GTTCAGGTTG ATACAAATAA TAACCTGAAC AGCATGTGGG CCGTGAAACT GCAGCAGATG
CAGGACGGAC GCCTTTATAT TGCGGGTATC GGTGCCGGTA TTGAGAATAC GCCAGCAGGA
ATGCAGAGTC AGGTGCTGCT GGCGGCAGAC AGGATTGCGA TGATTAATCC TGCGAATGGC
AACACAAAGC CGATGTTTGT TGGTCAGGGT GATCAGATAT TCATGAACGA CGTGTTCCTG
AAACGCCTGA CGGCTCCCAC CATTACCAGC GGCGGTAATC CTCCTGCATT TTCCCTTACA
CCGGACGGGC GACTGACGGC GAAAAATGCG GATATCAGTG GCAGTGTGAA TGCGAACGCC
GGGACGCTCA ACAATGTCAC GATAAATGAA AACTGTCGGG TTCTGGGAAA ACTGTCTGCG
AACCAGATTG AAGGCGATCT CGTTAAAACA GTGGGCAAAG CTTTCCCCCG GGACTCCCGT
GCACCGGAGC GGTGGCCATC AGGGACCATT ACCGTCAGGG TTTATGACGA TCAGCCGTTT
GACCGGCAGA TTGTTATTCC CACGGTGGCG TTTCGTGGCG CTAAACATGA GCGGGAGAAT
AACGATATTT ATTCGTCATG CCGCCTGATA GTGAAGAAAA ACGGTGCTGA AATTTATAAC
CGTACCGCGC TGGATAATAC GCTGGTTTAT ACAGGTGTTA TTGATATGCC TGCTGGTCGC
GGTCACATGA CGCTGGAGTT TTCGGTATCA GCGTGGCTGG TAAATGACTG GTATCCCACA
GCCAGTATCA GTGATTTGCT GGTTGTGGTG ATGAAGAAAT CCACAGCAGG TATCAGTATC
AGCTGA
 
Protein sequence
MLTHPRYGMG KRLGAADVDK WALYVIGQYC DQSVPDGFGG TEPRITCNAY LTTQRKAWDV 
LSDFCSAMRC MPVWNGQTLT FVQDRPSDKV WTYNRSNVVM PDDGAPFRYS FSALKDRHNA
VEVNWIDPNN GWETATELVE DTQAIARYGR NVTKMDAFGC TCRGQAHRAG LWLIKTELLE
TQTVDFSVGA EGLRHVPGDV IEICDDDYAG ISTGGRVLAV NSQTRTLTLD REITLPSSGT
TLISLVDGQG SPVSVEVQSV TDGVKVKVSR VPDGVAEYSV WGLKLPTLRQ RLFRCVSIRE
NDDGTYAITA VQHVPEKEAI VDNGAHFDGD QSGTVNGVTP PAVQHLTAEV TADSGEYQVL
ARWDTPKVVK GVSFMLRLTV AADDGSERLV STARTTETTY RFTQLALGRY TLTVRAVNAW
GQQGDPASVS FRIAAPVAPS RIELTPGYFQ ITATPHLAVY DPTVQFEFWF SETRITDIRQ
VETTARYLGA GLYWIAASIN IKPGHNYYFY VRSVNTVGKS AFVEAVGQPS DDASGYLDFF
KGEIGKTHLA QELWTQIDNG QLAPDLAEIR TSITDVSNEI TQTVNKKLED QSAAIQQIQK
VQVDTNNNLN SMWAVKLQQM QDGRLYIAGI GAGIENTPAG MQSQVLLAAD RIAMINPANG
NTKPMFVGQG DQIFMNDVFL KRLTAPTITS GGNPPAFSLT PDGRLTAKNA DISGSVNANA
GTLNNVTINE NCRVLGKLSA NQIEGDLVKT VGKAFPRDSR APERWPSGTI TVRVYDDQPF
DRQIVIPTVA FRGAKHEREN NDIYSSCRLI VKKNGAEIYN RTALDNTLVY TGVIDMPAGR
GHMTLEFSVS AWLVNDWYPT ASISDLLVVV MKKSTAGISI S