Gene ECH74115_3624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3624 
SymbolnupC 
ID6970185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3345291 
End bp3346493 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content49% 
IMG OID643387419 
Productnucleoside transporter NupC 
Protein accessionYP_002271878 
Protein GI209400703 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00341076 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.575462 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCGCG TCCTTCATTT TGTACTGGCA CTTGCCGTTG TTGCGATCCT CGCACTGCTG 
GTAAGCAGCG ACCGCAAAAA AATTCGTATC CGTTATGTTA TTCAACTGCT TGTTATCGAA
GTGTTACTGG CGTGGTTCTT CCTGAACTCC GACGTTGGTT TAGGCTTCGT GAAAGGCTTC
TCCGAAATGT TCGAAAAACT GCTCGGATTT GCCAACGAAG GGACTAACTT CGTCTTTGGT
AGCATGAATG ATCAAGGCCT GGCATTCTTC TTCCTGAAAG TGCTGTGCCC AATCGTCTTT
ATCTCTGCAC TGATCGGTAT TCTCCAGCAC ATTCGCGTGT TGCCGGTGAT TATCCGCGCA
ATTGGTTTCC TGCTCTCCAA AGTCAACGGC ATGGGCAAAC TGGAATCCTT TAACGCCGTC
AGCTCCCTGA TTCTGGGTCA GTCTGAAAAC TTTATTGCCT ATAAAGATAT CCTTGGCAAA
ATCTCCCGTA ACCGTATGTA CACCATGGCA GCAACGGCGA TGTCCACCGT TTCTATGTCC
ATCGTTGGTG CATACATGAC CATGCTGGAA CCAAAATACG TCGTTGCAGC GCTGGTACTG
AACATGTTCA GCACCTTTAT CGTGCTGTCG CTGATCAACC CTTACCGTGT TGATGCCTGT
GAAGAAAACA TTCAGATGTC CAACCTGCAC GAAGGTCAGA GCTTCTTCGA AATGCTGGGT
GAATACATTC TGGCAGGTTT CAAAGTTGCC ATTATCGTTG CCGCAATGCT GATTGGCTTT
ATCGCCCTGA TCGCCGCGCT GAACGCACTG TTTGCTACCG TGACTGGCTG GATTGGCTAC
AGCATCTCCT TCCAGGGCAT CCTGGGCTAC ATCTTCTATC CGATTGCATG GGTGATGGGT
GTTCCTTCCA GTGAAGCACT GCAAGTGGGC AGTATCATGG CGACCAAACT GGTTTCCAAC
GAGTTCGTTG CGATGATGGA TCTTCAGAAA ATTGCCTCTA CACTCTCTCC GCGTGCTGAA
GGCATCATCT CTGTGTTCCT GGTTTCCTTC GCTAACTTCT CTTCAATCGG GATTATCGCA
GGTGCAGTTA AAGGCCTGAA TGAAGAGCAA GGTAACGTGG TTTCTCGCTT CGGTCTGAAG
CTGGTTTACG GCTCTACCCT GGTGAGTGTG CTGTCTGCGT CAATCGCAGC ACTGGTGCTG
TAA
 
Protein sequence
MDRVLHFVLA LAVVAILALL VSSDRKKIRI RYVIQLLVIE VLLAWFFLNS DVGLGFVKGF 
SEMFEKLLGF ANEGTNFVFG SMNDQGLAFF FLKVLCPIVF ISALIGILQH IRVLPVIIRA
IGFLLSKVNG MGKLESFNAV SSLILGQSEN FIAYKDILGK ISRNRMYTMA ATAMSTVSMS
IVGAYMTMLE PKYVVAALVL NMFSTFIVLS LINPYRVDAC EENIQMSNLH EGQSFFEMLG
EYILAGFKVA IIVAAMLIGF IALIAALNAL FATVTGWIGY SISFQGILGY IFYPIAWVMG
VPSSEALQVG SIMATKLVSN EFVAMMDLQK IASTLSPRAE GIISVFLVSF ANFSSIGIIA
GAVKGLNEEQ GNVVSRFGLK LVYGSTLVSV LSASIAALVL