Gene SAG2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG2071 
Symbol 
ID1014882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp2052834 
End bp2054036 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content36% 
IMG OID637317237 
ProductNa+ dependent nucleoside transporter 
Protein accessionNP_689057 
Protein GI22538206 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter
[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.626229 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTTA TTTATAGTAT TATTGGTATT TTATTGGTAT TAGGAATTGT GTATGCAATT 
TCTTTCAATC GTAAGAGTGT TTCTCTAAGT TTAATTGGAA AAGCTCTTAT CGTTCAATTC
ATTATTGCGC TAATCTTAGT ACGTATCCCA CTAGGTCAAC AAGTTGTTAG TGTTGTTTCA
ACTGGAGTTA CTAAAGTAAT CAACTGTGGT CAAGCTGGAT TAAATTTTGT GTTTGGTTCA
TTAGCAGATA GTGGAGCAAA AACTGGTTTT ATTTTCGCCA TTCAAACGCT TGGCAATATT
GTTTTCTTAT CTGCCCTAGT TAGTCTACTT TATTATGTAG GAATCCTTGG ATTTGTAGTA
AAATGGATAG GTAAGGGCGT TGGTAAAATT ATGAAATCCT CAGAGGTTGA GAGTTTTGTT
GCCGTAGCTA ATATGTTTCT TGGTCAAACA GACAGTCCAA TTTTGGTTAG CAAATACCTA
GGTCGTATGA CTGATAGTGA GATAATGGTT GTGTTGGTAT CAGGTATGGG AAGTATGTCA
GTTTCTATTC TTGGTGGCTA CATTGCATTA GGCATTCCAA TGGAATATCT CTTGATTGCT
TCAACAATGG TTCCTATTGG CAGTATTCTC ATTGCTAAAA TCTTATTGCC TCAAACAGAA
CCTGTTCAAA AAATTGATGA CATTAAGATG GATAATAAAG GTAATAACGC CAATGTGATT
GATGCAATCG CTGAGGGTGC AAGCACAGGT GCACAAATGG CTTTCTCAAT TGGTGCTAGT
TTGATTGCCT TTGTTGGTTT AGTTTCTTTG ATTAATATGA TGTTAAGTGG ATTGGGAATC
CGCTTAGAAC AAATCTTCTC ATATGTTTTT GCTCCATTTG GTTTTCTTAT GGGATTTGAC
CACAAAAACA TTCTTCTAGA AGGAAACCTT CTTGGAAGTA AGTTGATTTT AAATGAGTTT
GTTTCGTTCC AACAATTGGG TGACCTAATC AAATCTTTAG ATTATCGTAC AGCATTGGTA
GCAACTATTT CACTTTGTGG TTTTGCTAAT TTATCAAGTT TAGGTATTTG TGTTTCAGGT
ATTGCTGTTC TTTGTCCAGA GAAACGTGGC ACCCTAGCTC GACTTGTTTT CCGTGCAATG
ATTGGTGGTA TTGCTGTAAG TATGCTTAGC GCCTTTATCG TCGGTATTGT AACTCTATTC
TAA
 
Protein sequence
MQFIYSIIGI LLVLGIVYAI SFNRKSVSLS LIGKALIVQF IIALILVRIP LGQQVVSVVS 
TGVTKVINCG QAGLNFVFGS LADSGAKTGF IFAIQTLGNI VFLSALVSLL YYVGILGFVV
KWIGKGVGKI MKSSEVESFV AVANMFLGQT DSPILVSKYL GRMTDSEIMV VLVSGMGSMS
VSILGGYIAL GIPMEYLLIA STMVPIGSIL IAKILLPQTE PVQKIDDIKM DNKGNNANVI
DAIAEGASTG AQMAFSIGAS LIAFVGLVSL INMMLSGLGI RLEQIFSYVF APFGFLMGFD
HKNILLEGNL LGSKLILNEF VSFQQLGDLI KSLDYRTALV ATISLCGFAN LSSLGICVSG
IAVLCPEKRG TLARLVFRAM IGGIAVSMLS AFIVGIVTLF