Gene EcSMS35_3020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3020 
Symbol 
ID6143151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3110039 
End bp3111616 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content49% 
IMG OID641617889 
Productputative xanthine permease 
Protein accessionYP_001745040 
Protein GI170679688 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2233] Xanthine/uracil permeases 
TIGRFAM ID[TIGR00801] uracil-xanthine permease
[TIGR03173] xanthine permease 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTCCGC AATTCCTGTC TCTAACTCCC CTTCCTCGCA AAAACTGGCA CTCCACGAGC 
ATGTGTTTAG ACAGTTTCAT TAACGTAAAC GGTTGCTTTT TACTCTGGCG AGCGAAAGGA
GAAACACTGA TGAGCGCCAT AGATTCCCAA CTTCCCTCAT CTTCAGGGCA AGATCGCCCA
ACTGATGAGG TTGACCGCAT ATTATCACCA GGAAAGCTGA TCATACTCGG TCTGCAACAC
GTCCTTGTCA TGTACGCAGG TGCAGTCGCT GTTCCTCTTA TGATTGGTGA CCGACTGGGC
CTCTCAAAAG AAGCTATTGC GATGCTCATT AGCTCGGATC TCTTTTGCTG CGGGATCGTC
ACATTATTGC AATGTATCGG TATCGGCCGC TTTATGGGGA TCCGCCTGCC GGTGATTATG
TCGGTGACCT TCGCCGCTGT AACACCAATG ATAGCCATTG GGATGAACCC GGATATCGGC
CTGCTGGGGA TATTCGGTGC CACTATCGCC GCGGGTTTTA TCACCACATT ATTAGCGCCA
CTTATCGGTC GCTTGATGCC TTTATTCCCG CCACTGGTTA CCGGTGTGGT TATTACTTCT
ATCGGACTTA GCATCATTCA GGTGGGTATT GACTGGGCCG CAGGAGGTAA AGGGAATCCG
CAATATGGTA ATCCCGTTTA TTTAGGTATC TCCTTTGCCG TCTTAATTTT TATCTTGCTC
ATTACTCGCT ATGCGAAAGG ATTTATGTCC AACGTCGCCG TATTACTGGG GATTGTATTT
GGCTTTTTAC TTTCGTGGAT GATGAATGAA GTCAATTTAT CCGGGCTACA TGATGCTTCA
TGGTTTGCGA TTGTCACGCC GATGTCATTT GGTATGCCGA TTTTCGATCC CGTTTCCATT
CTGACCATGA CTGCCGTGTT AATCATCGTG TTTATCGAGT CGATGGGGAT GTTCCTGGCA
CTGGGTGAAA TAGTCGGTCG TAAACTCTCT TCACACGATA TTATTCGCGG GCTGCGTGTC
GATGGCGTAG GGACAATGAT AGGCGGAACG TTTAACAGCT TCCCCCACAC GTCATTTTCA
CAAAACGTTG GCCTGGTTAG CGTGACGCGC GTTCATAGCC GCTGGGTGTG TATTTCTTCG
GGAATTATAT TAATCCTGTT TGGCATGGTG CCAAAAATGG CGGTGCTGGT CGCCTCCATT
CCGCAATTTG TGCTGGGCGG TGCTGGGCTG GTGATGTTCG GCATGGTACT GGCGACAGGA
ATTCGAATTC TGTCGCGCTG TAACTACACC ACCAACCGTT ACAACCTCTA TATTGTGGCG
ATCAGTCTCG GCGTTGGCAT GACTCCGACG CTCTCTCACG ATTTCTTTTC TAAGTTACCG
GCCGTACTGC AACCGTTGCT GCATAGCGGC ATTATGCTCG CAACCCTTAG CGCCGTTGTG
CTGAACGTCT TCTTTAATGG CTATCAGCAT CATGCTGATC TGGTGAAGGA ATCCGTCTCT
GATAAAGATT TAAAAGTCAG GACAGTACGT ATGTGGCTTC TGATGCGCAA GCTGAAGAAA
AATGAGCATG GAGAATAA
 
Protein sequence
MPPQFLSLTP LPRKNWHSTS MCLDSFINVN GCFLLWRAKG ETLMSAIDSQ LPSSSGQDRP 
TDEVDRILSP GKLIILGLQH VLVMYAGAVA VPLMIGDRLG LSKEAIAMLI SSDLFCCGIV
TLLQCIGIGR FMGIRLPVIM SVTFAAVTPM IAIGMNPDIG LLGIFGATIA AGFITTLLAP
LIGRLMPLFP PLVTGVVITS IGLSIIQVGI DWAAGGKGNP QYGNPVYLGI SFAVLIFILL
ITRYAKGFMS NVAVLLGIVF GFLLSWMMNE VNLSGLHDAS WFAIVTPMSF GMPIFDPVSI
LTMTAVLIIV FIESMGMFLA LGEIVGRKLS SHDIIRGLRV DGVGTMIGGT FNSFPHTSFS
QNVGLVSVTR VHSRWVCISS GIILILFGMV PKMAVLVASI PQFVLGGAGL VMFGMVLATG
IRILSRCNYT TNRYNLYIVA ISLGVGMTPT LSHDFFSKLP AVLQPLLHSG IMLATLSAVV
LNVFFNGYQH HADLVKESVS DKDLKVRTVR MWLLMRKLKK NEHGE