Gene BAS5085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5085 
Symbol 
ID2849207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4964330 
End bp4965541 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content37% 
IMG OID637508340 
ProductNupC family nucleoside transporter 
Protein accessionYP_031324 
Protein GI49188071 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value5.01622e-09 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTTT TATGGGGAAT TGGCGGCGTG ATTGGAGTAT TAGCAATCGC TTTTTTACTA 
TCTTCCAACC GCAAAGCTAT TAATTGGCGC ACAATTTTAA TCGCGCTAGC ATTACAAATG
TCATTTTCAT TTATCGTATT ACGCTGGGAT GCCGGAAAAG CAGGTTTAAA ACACGCTGCA
GATGGCGTTC AAGGATTAAT TAATTTTTCT TACGAGGGAA TTAAGTTCGT TGCTGGGGAT
TTAGTCAACG CAAAAGGCCC TTGGGGATTT GTTTTCTTCA TTCAAGCACT ACTTCCAATC
GTATTTATTA GTTCATTAGT AGCAATCTTA TATCATTTCG GTATTATGCA AAGATTTGTT
AGTGTCGTTG GTGGCGCATT AAGTAAACTT CTTGGAACTT CTAAAGCAGA AAGTTTAAAC
TCAGTAACAA CTGTATTTTT AGGACAAACT GAAGCTCCAA TCTTAATCAA ACCTTACTTA
GCACGTTTAA CAAATAGTGA ATTCTTCGCT ATTATGGTAA GCGGTATGAC AGCTGTTGCT
GGATCAGTTC TTGTCGGTTA TGCAGCAATG GGTATTCCGT TAGAACACTT ATTAGCAGCA
GCAATTATGG CAGCTCCATC AAGTTTATTA ATTGCAAAAT TAATTATGCC AGAAACAGAA
AAAGTAGATA ATAACGTTGA ACTTTCTACA GAACGTGAAG ATGCAAACGT TATTGACGCT
GCGGCACGTG GTGCATCTGA AGGTATGCAA CTTGTTATTA ACGTAGCAGC AATGTTAATG
GCTTTTATCG CATTAATCGC TTTACTAAAC GGTTTATTAG GATTAATTGG CTCTCTGTTT
GATATTAAAC TTAGTCTTGA TTTAATCTTC GGTTATTTAC TATCACCATT TGCAATTTTA
ATCGGGGTTT CTCCTGGTGA AGCTGTACAA GCAGCAAGCT TTATCGGTCA AAAACTTGCA
ATCAACGAAT TCGTTGCATA CGCAAACTTA GGACCACACA TGGCAGAGTT CTCTGACAAA
ACAAATTTAA TTTTAACATT CGCAATCTGT GGATTCGCAA ACTTCTCTTC TATCGCAATT
CAATTAGGTG TAACAGGAAC ATTGGCTCCT ACTCGCCGTA AACAAATTGC ACAATTAGGG
ATTAAAGCAG TTATCGCTGG TACATTAGCA AACTTCTTAA ATGCAGCAGT TGCAGGTATG
ATGTTCCTAT AA
 
Protein sequence
MNLLWGIGGV IGVLAIAFLL SSNRKAINWR TILIALALQM SFSFIVLRWD AGKAGLKHAA 
DGVQGLINFS YEGIKFVAGD LVNAKGPWGF VFFIQALLPI VFISSLVAIL YHFGIMQRFV
SVVGGALSKL LGTSKAESLN SVTTVFLGQT EAPILIKPYL ARLTNSEFFA IMVSGMTAVA
GSVLVGYAAM GIPLEHLLAA AIMAAPSSLL IAKLIMPETE KVDNNVELST EREDANVIDA
AARGASEGMQ LVINVAAMLM AFIALIALLN GLLGLIGSLF DIKLSLDLIF GYLLSPFAIL
IGVSPGEAVQ AASFIGQKLA INEFVAYANL GPHMAEFSDK TNLILTFAIC GFANFSSIAI
QLGVTGTLAP TRRKQIAQLG IKAVIAGTLA NFLNAAVAGM MFL