Gene EcHS_A1580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1580 
Symbol 
ID5593894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1592070 
End bp1594442 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content46% 
IMG OID640920733 
ProductTonB-dependent receptor 
Protein accessionYP_001458289 
Protein GI157160971 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGAG TTCTTATTCC TGGCGTCATT TTATGTGGCG CTGATGTGGC GCAGGCCGTC 
GATGACAAAA ACATGTACAT GCATTTTTTT GAAGAGATGA CGGTCTATGC TCCTGTCCCT
GTACCCGTAA ACGGCAACAC GCATTACACC AGTGAAAGCA TCGAGCGTTT ACCGACCGGG
AATGGCAATA TCAGCGATCT GCTGAGAACC AACCCTGCGG TACGCATGGA TTCAACGCAA
AGTACCTCGT TGAACCAGGG AGATATTCGC CCGGAGAAAA TCTCTATTCA CGGTGCGTCG
CCCTACCAGA ATGCCTATTT GATTGATGGT ATTAGTGCCA CTAATAACCT GAACCCAGCG
AATGAGTCCG ATGCCAGTAG TGCAACCAAT ATTAGCGGGA TGTCACAGGG GTATTATCTT
GATGTCAGCT TACTGGACTA TGTGACGCTT TATGACAGTT TTGTGCCGGT TGAATTCGGT
CGTTTCAATG GCGGGGTAAT TGATGCAAAG ATCAAACGCT TCAACGCTGA TGATAGCAGC
GTGAAACTGG GTTATCGCAC TACGCGTTCG GACTGGTTAA CATCGCATAT CGATGAGAAT
AACAAGAGCG CATTTAATCA AGGTTCTTCA GGAAGTACTT ATTACTCTCC AGATTTTAAA
AAGAACTTTT ATACCTTGTC GTTTAATCAG GAACTCGCTG ATAACTTTGG CGTTACCGCC
GGTTTATCGC GCCGCCAGTC TGATATCACC CGTGCGGATT ATGTTTCGAA TGACGGCATT
GTCGCCGGTC GGGCACAGTA TAAAAACGTT ATCGATACTG CATTGAGCAA ATTTACCTGG
TTTGCCAGCG ACCGCTTTAC CCACGATTTA ACCTTAAAAT ATACCGGCTC CAGCCGTGAT
TATAATACCA GCACCTTCCC GCAGTCTGAT CGCGAAATGG GTAATAAATC CTATGGTCTG
GCATGGGATA TGGATACGCA GCTCGCATGG GCCAAACTAC GTACCACCGT TGGTTGGGAT
CATATTAGTG ATTATACCCG TCACGATCAT GACATCTGGT ACACCGAACT TTCATGTACA
TATGGTGATA TTACAGGGCG TTGCACCCGT GGCGGATTAG GACACATTTC CCAGGCTGTA
GATAATTACA CCTTCAAAAC ACGCCTGGAC TGGCAAAAAT TCGCCGTGGG TAATGTTTCG
CATCAAACCT ACTTCGGCGC GGAATACATC TATTCCGATG CATGGACTGA ACGCCATAAC
CAGTCTGAAT CCTATGTGAT TAATGCTGCC GGAAAGAAAA CTAACCATAC CATTTACCAT
AAAGGTAAAG GCAGCCTGGG AATTGACAAC TACACACTGT ATATGGCGGA TCGCATTAGC
TGGCGTAATG TGTCATTAAT GCCCGGCGTG CGGTATGACT ATGACAACTA TCTGTCAAAC
CACAATATCT CCCCGCGCTT TATGACGGAA TGGGATATTT TTGCTGATCA AACCTCAATG
ATTACCGCAG GTTATAACCG TTACTATGGT GGGAATATTC TTGATATGGG ATTACGTGAT
ATCCGCAATA GCTGGACGGA ATCGGTATCA GGCAATAAAA CCCTGACGCG TTATCAGGAT
TTGAAAACGC CTTATAACGA TGAACTGGCA ATGGGATTGC AGCAAAAAAT CGGTAAGAAC
GTTATTGCGC GCGCAAACTA TGTTTACCGT GAAGCGCATG ATCAAATCAG CAAAAGCAGT
CGTACCGACA GCGCGACTAA AACCACCATT ACTGAATATA ACAACGATGG CAAAACCAAA
ACGCATTCAT TCAACCTCAG TTTTGAGCTG GCCGAACCCC TGCATATCCG CCAGGTAGAT
ATTAACCCAC AAATTGTCTT TAGCTATATC AAGAGCAAGG GCAACTTGTC GTTAAACAAT
GGTTATGAGG AGAGCAATAC CGGTGATAAC CAGGTGGTTT ATAACGGTAA TCTGGTCTCT
TACGATAGCG TTCCAGTGGC AGATTTTAAT AACCCATTAA AGATCTCCTT AAACATGGAT
TTCACGCATC AACCGAGCGG GTTAGTGTGG GCGAATACGC TGGCCTGGCA AGAAGCGCGT
AAAGCTCGCA TTATCCTGGG TAAGGCGAAT GCGCAATACA TCAGCGAATA TTCAGATTAC
AAGCAGTATG TTGACGAAAA ACTGGATAGC AGCCTGACCT GGGACACCCG CTTGTCCTGG
ACGCCACAAT TTCTGCAACA ACAAAACCTG ACGATCAGTG CCGATATTCT CAATGTACTG
GATAGCAAAA CCGCTGTTGA TACAACGAAT ACCGGTGTGG CGACCTACGC CAGTGGCCGT
ACTTTCTGGC TTGATGTCAG CATGAAATTT TAA
 
Protein sequence
MKRVLIPGVI LCGADVAQAV DDKNMYMHFF EEMTVYAPVP VPVNGNTHYT SESIERLPTG 
NGNISDLLRT NPAVRMDSTQ STSLNQGDIR PEKISIHGAS PYQNAYLIDG ISATNNLNPA
NESDASSATN ISGMSQGYYL DVSLLDYVTL YDSFVPVEFG RFNGGVIDAK IKRFNADDSS
VKLGYRTTRS DWLTSHIDEN NKSAFNQGSS GSTYYSPDFK KNFYTLSFNQ ELADNFGVTA
GLSRRQSDIT RADYVSNDGI VAGRAQYKNV IDTALSKFTW FASDRFTHDL TLKYTGSSRD
YNTSTFPQSD REMGNKSYGL AWDMDTQLAW AKLRTTVGWD HISDYTRHDH DIWYTELSCT
YGDITGRCTR GGLGHISQAV DNYTFKTRLD WQKFAVGNVS HQTYFGAEYI YSDAWTERHN
QSESYVINAA GKKTNHTIYH KGKGSLGIDN YTLYMADRIS WRNVSLMPGV RYDYDNYLSN
HNISPRFMTE WDIFADQTSM ITAGYNRYYG GNILDMGLRD IRNSWTESVS GNKTLTRYQD
LKTPYNDELA MGLQQKIGKN VIARANYVYR EAHDQISKSS RTDSATKTTI TEYNNDGKTK
THSFNLSFEL AEPLHIRQVD INPQIVFSYI KSKGNLSLNN GYEESNTGDN QVVYNGNLVS
YDSVPVADFN NPLKISLNMD FTHQPSGLVW ANTLAWQEAR KARIILGKAN AQYISEYSDY
KQYVDEKLDS SLTWDTRLSW TPQFLQQQNL TISADILNVL DSKTAVDTTN TGVATYASGR
TFWLDVSMKF