Gene SbBS512_E1140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1140 
SymbolyegT 
ID6269825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1031511 
End bp1032788 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content50% 
IMG OID641725269 
Productnucleoside transporter 
Protein accessionYP_001879787 
Protein GI187732934 
COG category 
COG ID 
TIGRFAM ID[TIGR00889] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.102088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACAA CAGCAAAGCT GTCGTTCATG ATGTTTGTTG AATGGTTTAT CTGGGGCGCG 
TGGTTTGTGC CATTGTGGTT GTGGTTAAGT AAAAGCGGTT TTAGTGCCGG AGAAATTGGC
TGGTCGTATG CCTGTACCGC CATTGCGGCG ATCCTGTCGC CAATTCTGGT TGGCTCCATC
ACTGACCGCT TTTTCTCGGC GCAGAAAGTG CTGGCGGTAT TGATGTTCGC AGGCGCGCTG
CTGATGTATT TCGCTGCGCA ACAGACCACT TTTGCCGGGT TCTTCCCGTT ACTGCTGGCC
TACTCGCTAA CCTATATGCC GACCATTGCG CTGACTAACA GCATCGCTTT TGCCAACGTG
CCGGATGTTG AGCGTGATTT CCCGCGCATT CGTGTGATGG GCACTATCGG CTGGATTGCC
TCCGGTCTGG CATGTGGTTT CTTGCCGCAA ATACTGGGGT ATGCCGATAT CTCACCGACT
AACATCCCGC TGCTGATTAC CGCCGGAAGT TCTGCTCTGC TCGGTGTGTT TGCGTTTTTC
CTGCCCGACA CGCCACCAAA AAGCACCGGC AAAATGGATA TTAAAGTCAT GCTCGGCCTG
GATGCGCTGA TCCTGCTGCG CGATAAAAAC TTCCTCGTCT TTTTCTTCTG TTCATTCCTG
TTTGCGATGC CACTGGCGTT CTATTACATC TTTGCCAACG GTTATCTGAC CGAAGTTGGC
ATGAAAAACG CCACCGGCTG GATGACGCTC GGCCAGTTCT CTGAAATCTT CTTTATGCTG
GCATTGCCGT TTTTCACTAA ACGCTTTGGT ATCAAAAAGG TGTTATTGCT TGGTCTGGTC
ACCGCTGCGA TCCGCTATGG CTTCTTTATT TACGGTAGTG CGGATGAATA TTTCACCTAC
GCGTTACTGT TCCTCGGTAT TTTGCTTCAC GGCGTAAGTT ACGATTTTTA CTACGTTACC
GCTTACATCT ATGTCGATAA AAAAGCCCCC GTGCATATGC GTACCGCTGC GCAGGGGCTG
ATCACGCTCT GCTGCCAGGG CTTCGGCAGT TTGCTCGGCT ATCGTCTTGG CGGTGTGATG
ATGGAAAAGA TGTTCGCTTA TCAGGAACCG GTAAACGGAC TGACTTTCAA CTGGTCCGGG
ATGTGGACTT TCGGCGCGGT GATGATTGCC ATTATCGCCG TGCTGTTCAT GATTTTTTTC
CGCGAATCCG ACAACGAAAT TACGGCTATC AAGGTCGATG ATCGCGATAT TGCGTTGACA
CAAGGGGAAG TTAAATGA
 
Protein sequence
MKTTAKLSFM MFVEWFIWGA WFVPLWLWLS KSGFSAGEIG WSYACTAIAA ILSPILVGSI 
TDRFFSAQKV LAVLMFAGAL LMYFAAQQTT FAGFFPLLLA YSLTYMPTIA LTNSIAFANV
PDVERDFPRI RVMGTIGWIA SGLACGFLPQ ILGYADISPT NIPLLITAGS SALLGVFAFF
LPDTPPKSTG KMDIKVMLGL DALILLRDKN FLVFFFCSFL FAMPLAFYYI FANGYLTEVG
MKNATGWMTL GQFSEIFFML ALPFFTKRFG IKKVLLLGLV TAAIRYGFFI YGSADEYFTY
ALLFLGILLH GVSYDFYYVT AYIYVDKKAP VHMRTAAQGL ITLCCQGFGS LLGYRLGGVM
MEKMFAYQEP VNGLTFNWSG MWTFGAVMIA IIAVLFMIFF RESDNEITAI KVDDRDIALT
QGEVK