Gene SbBS512_E2761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2761 
SymbolnupC 
ID6271857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2567743 
End bp2568945 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content50% 
IMG OID641726720 
Productnucleoside transporter NupC 
Protein accessionYP_001881199 
Protein GI187732844 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.246953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCGCG TCCTTCATTT TGTACTGGCA CTTGCCGTTG TTGCGATTCT CGCACTGCTG 
GTAAGCAGCG ACCGCAAAAA AATTCGTATC CGTTATGTTA TTCAACTGCT TGTTATCGAA
GTGTTACTGG CGTGGTTCTT CCTGAACTCC GACGTTGGTT TAGGCTTCGT GAAAGGCTTC
TCCGAAATGT TCGAAAAACT GCTCGGATTT GCCAACGAAG GGACTAACTT CGTCTTTGGT
AGCATGAATG ATCAAGGCCT GGCATTCTTC TTCCTGAAAG TGCTGTGCCC AATCGTCTTT
ATCTCTGCAC TGATCGGTAT TCTCCAGCAC ATTCGCGTGT TGCCGGTGAT CATCCGCGCA
ATTGGTTTCC TGCTCTCCAA AGTCAACGGC ATGGGCAAAC TGGAATCCTT TAACGCCGTC
AGCTCCCTGA TTCTGGGTCA GTCTGAAAAC TTTATTGCCT ATAAAGATAT CCTCGGCAAA
ATCTCCCGTA ATCGTATGTA CACCATGGCT GCCACGGCAA TGTCCACCGT GTCGATGTCC
ATCGTTGGTG CATACATGAC CATGCTGGAA CCGAAATACG TCGTTGCTGC GCTGGTATTG
AACATGTTCA GCACCTTTAT CGTGCTGTCG CTGATCAACC CTTACCGTGT TGATGCCAGT
GAAGAAAACA TTCAGATGTC CAACCTGCAC GAAGGTCAGA GCTTCTTCGA AATGCTGGGT
GAATACATTC TGGCAGGTTT CAAAGTTGCC ATTATCGTTG CCGCGATGCT GATCGGCTTT
ATCGCCCTGA TCGCTGCGCT GAACGCACTG TTTGCCACCG TGACTGGCTG GTTTGGCTAC
AGCATCTCCT TCCAGGGCAT TCTGGGCTAC ATCTTCTATC CGATTGCATG GGTGATGGGT
GTTCCTTCCA GTGAAGCACT GCAAGTGGGC AGTATCATGG CGACCAAACT GGTTTCCAAC
GAGTTCGTTG CGATGATGGA TCTGCAGAAA ATTGCTTCCA CGCTCTCTCC GCGTGCTGAA
GGCATCATCT CTGTGTTCCT GGTTTCCTTC GCTAACTTCT CTTCAATCGG GATTATCGCA
GGTGCAGTTA AAGGCCTGAA TGAAGAGCAA GGTAACGTGG TTTCTCGCTT CGGTCTGAAA
CTGGTTTACG GCTCTACCCT GGTGAGTGTG CTGTCTGCGT CAATCGCAGC ACTGGTGCTG
TAA
 
Protein sequence
MDRVLHFVLA LAVVAILALL VSSDRKKIRI RYVIQLLVIE VLLAWFFLNS DVGLGFVKGF 
SEMFEKLLGF ANEGTNFVFG SMNDQGLAFF FLKVLCPIVF ISALIGILQH IRVLPVIIRA
IGFLLSKVNG MGKLESFNAV SSLILGQSEN FIAYKDILGK ISRNRMYTMA ATAMSTVSMS
IVGAYMTMLE PKYVVAALVL NMFSTFIVLS LINPYRVDAS EENIQMSNLH EGQSFFEMLG
EYILAGFKVA IIVAAMLIGF IALIAALNAL FATVTGWFGY SISFQGILGY IFYPIAWVMG
VPSSEALQVG SIMATKLVSN EFVAMMDLQK IASTLSPRAE GIISVFLVSF ANFSSIGIIA
GAVKGLNEEQ GNVVSRFGLK LVYGSTLVSV LSASIAALVL