Gene EcHS_A3896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3896 
Symbol 
ID5591342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3891783 
End bp3893444 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content53% 
IMG OID640923004 
Producthypothetical protein 
Protein accessionYP_001460481 
Protein GI157163163 
COG category[R] General function prediction only 
COG ID[COG2985] Predicted permease 
TIGRFAM ID[TIGR01625] AspT/YidE/YbjL antiporter duplication domain 


Plasmid Coverage information

Num covering plasmid clones73 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATA TAGCATTAAC GGTCAGTATT CTGGCTTTGG TGGCAGTCGT CGGTTTGTTT 
ATCGGCAACG TCAAATTTCG CGGCATAGGA TTAGGTATTG GCGGCGTGCT GTTTGGTGGG
ATCATCGTCG GCCATTTTGT TTCTCAGGCG GGAATGACAT TAAGTAGCGA TATGCTGCAT
GTTATTCAGG AATTTGGCCT GATCCTGTTC GTTTATACCA TCGGGATTCA GGTGGGTCCG
GGCTTCTTTG CCTCATTGCG CGTCTCCGGA TTACGCCTCA ACCTGTTTGC TGTTCTGATC
GTCATCATCG GTGGTCTGGT TACCGCCATC CTGCATAAAC TGTTTGATAT TCCACTGCCG
GTAGTGCTGG GGATTTTCTC CGGTGCGGTA ACCAATACGC CAGCGCTGGG GGCAGGGCAG
CAGATCTTGC GCGACCTGGG TACACCAATG GAAATGGTCG ATCAGATGGG GATGAGTTAT
GCGATGGCGT ATCCATTCGG CATTTGCGGG ATATTGTTCA CCATGTGGAT GTTGCGGGTT
ATTTTCCGCG TCAATGTCGA GACAGAAGCC CAGCAGCACG AGTCTTCACG CACCAATGGC
GGCGCGCTGA TCAGGACTAT CAATATTCGC GTTGAGAACC CTAACCTGCA TGATTTAGCC
ATTAAAGATG TACCTATTCT CAACGGCGAC AAAATTATCT GCTCGCGTCT GAAACGTGAA
GAAACCCTAA AAGTTCCTTC GCCAGATACC ATTATCCAAC TGGGCGATTT GCTGCATCTG
GTGGGGCAGC CAGCGGATTT ACATAATGCG CAACTGGTGA TTGGTCAGGA GGTCGATACC
TCGCTGTCTA CGAAAGGCAC TGATTTGCGC GTCGAGCGTG TGGTGGTCAC CAATGAAAAC
GTGCTCGGAA AACGTATTCG CGACCTGCAC TTTAAAGAAC GCTATGACGT TGTTATCTCG
CGCCTGAACC GTGCCGGGGT CGAACTGGTC GCCAGTGGCG ATATCAGCCT GCAGTTCGGC
GATATTCTCA ACCTGGTGGG GCGTCCGTCC GCAATTGATG CCGTTGCCAA TGTGCTGGGG
AATGCGCAGC AAAAACTGCA ACAGGTTCAG ATGTTGCCGG TGTTTATTGG TATCGGGCTT
GGCGTATTGT TAGGCTCTAT TCCCGTCTTT GTGCCGGGAT TCCCGGCCGC GTTGAAACTG
GGGCTGGCGG GCGGTCCGCT GATTATGGCG TTGATCCTCG GGCGTATCGG CAGTATTGGC
AAGCTGTACT GGTTTATGCC GCCAAGTGCC AACCTCGCGC TGCGGGAGCT GGGGATCGTG
CTGTTCCTCT CGGTCGTTGG TCTGAAATCT GGTGGGGATT TTGTGAATAC CCTGGTCAAT
GGCGAAGGGC TAAGCTGGAT TGGTTATGGT GCCCTGATCA CCGCCGTTCC GCTGATTACT
GTTGGTATTC TGGCGCGGAT GTTAGCCAAA ATGAATTACC TGACCATGTG CGGGATGCTG
GCTGGCTCCA TGACCGATCC ACCGGCGCTG GCATTTGCTA ATAATCTTCA TCCAACCAGC
GGTGCAGCGG CGCTCTCTTA CGCCACTGTC TATCCGTTAG TGATGTTCCT GCGCATTATC
ACCCCCCAAT TACTGGCGGT GCTCTTCTGG AGTATCGGTT AA
 
Protein sequence
MSDIALTVSI LALVAVVGLF IGNVKFRGIG LGIGGVLFGG IIVGHFVSQA GMTLSSDMLH 
VIQEFGLILF VYTIGIQVGP GFFASLRVSG LRLNLFAVLI VIIGGLVTAI LHKLFDIPLP
VVLGIFSGAV TNTPALGAGQ QILRDLGTPM EMVDQMGMSY AMAYPFGICG ILFTMWMLRV
IFRVNVETEA QQHESSRTNG GALIRTINIR VENPNLHDLA IKDVPILNGD KIICSRLKRE
ETLKVPSPDT IIQLGDLLHL VGQPADLHNA QLVIGQEVDT SLSTKGTDLR VERVVVTNEN
VLGKRIRDLH FKERYDVVIS RLNRAGVELV ASGDISLQFG DILNLVGRPS AIDAVANVLG
NAQQKLQQVQ MLPVFIGIGL GVLLGSIPVF VPGFPAALKL GLAGGPLIMA LILGRIGSIG
KLYWFMPPSA NLALRELGIV LFLSVVGLKS GGDFVNTLVN GEGLSWIGYG ALITAVPLIT
VGILARMLAK MNYLTMCGML AGSMTDPPAL AFANNLHPTS GAAALSYATV YPLVMFLRII
TPQLLAVLFW SIG