Gene BCG9842_B5601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B5601 
Symbol 
ID7183938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp5102734 
End bp5103945 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content38% 
IMG OID643553125 
Productnucleoside transporter, NupC family 
Protein accessionYP_002448766 
Protein GI218900355 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000740053 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value6.13455e-18 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATCTTT TATGGGGAAT TGGCGGCGTG ATTGGAGTAT TAGCAATTGC TTTCTTACTA 
TCTTCCAACC GCAAAGCTAT TAATTGGCGC ACAATTTTAA TTGCGCTAGC ATTACAAATG
TCATTTTCAT TTATCGTATT ACGATGGGAT GCTGGTAAAG CAGGTTTAAA ACACGCTGCT
GACGGTGTTC AAGGATTAAT TAATTTTTCT TACGAGGGAA TTAAGTTCGT TGCTGGGGAT
TTAGTCAACG CAAAAGGACC TTGGGGATTT GTATTCTTTA TTCAAGCACT ACTTCCAATC
GTATTTATTA GTTCATTAGT AGCAATCTTA TATCATTTCG GTATTATGCA GAAATTTGTT
AGCGTCGTTG GTGGTGCATT AAGTAAACTT CTTGGAACTT CTAAAGCAGA AAGCTTAAAC
TCAGTAACGA CTGTATTTTT AGGACAAACT GAAGCTCCAA TCTTAATTAA ACCTTACTTA
GCACGCTTAA CAAATAGTGA ATTCTTCACT ATTATGGTAA GCGGTATGAC AGCTGTTGCC
GGATCAGTTC TTGTCGGCTA TGCAGCAATG GGTATTCCGT TAGAGCACTT ATTAGCAGCT
GCAATTATGG CAGCTCCATC AAGCTTATTA ATTGCGAAAC TAATCATGCC AGAGACAGAA
AAAGTAGATA ATAACGTTGA ACTTTCTACA GAACGTGAAG ACGCAAACGT TATCGACGCA
GCTGCACGTG GTGCATCTGA AGGTATGCAA CTTGTTATTA ACGTAGCAGC AATGTTAATG
GCTTTCATTG CATTAATCGC TTTATTAAAT GGTCTATTAG GATTAGTTGG GTCTTTATTC
CATATTAAAC TTAGCCTTGA TTTAATCTTC GGTTACTTAT TATCACCATT TGCAATCTTA
ATCGGGGTTT CTCCAGGTGA AGCTGTACAA GCAGCAAGCT TTATCGGTCA AAAACTTGCA
ATCAACGAAT TCGTTGCATA CGCAAACTTA GGACCACATA TGGCAGAGTT CTCTGACAAA
ACAAATCTAA TTTTAACATT CGCAATCTGT GGATTCGCAA ACTTCTCTTC TATTGCAATT
CAATTAGGTG TAACAGGAAC GCTAGCTCCT ACTCGCCGTA AACAAATTGC ACAATTAGGG
ATTAAAGCAG TTATCGCTGG TACATTAGCT AACTTCTTAA ATGCAGCAGT TGCAGGTATG
ATGTTCCTAT AA
 
Protein sequence
MNLLWGIGGV IGVLAIAFLL SSNRKAINWR TILIALALQM SFSFIVLRWD AGKAGLKHAA 
DGVQGLINFS YEGIKFVAGD LVNAKGPWGF VFFIQALLPI VFISSLVAIL YHFGIMQKFV
SVVGGALSKL LGTSKAESLN SVTTVFLGQT EAPILIKPYL ARLTNSEFFT IMVSGMTAVA
GSVLVGYAAM GIPLEHLLAA AIMAAPSSLL IAKLIMPETE KVDNNVELST EREDANVIDA
AARGASEGMQ LVINVAAMLM AFIALIALLN GLLGLVGSLF HIKLSLDLIF GYLLSPFAIL
IGVSPGEAVQ AASFIGQKLA INEFVAYANL GPHMAEFSDK TNLILTFAIC GFANFSSIAI
QLGVTGTLAP TRRKQIAQLG IKAVIAGTLA NFLNAAVAGM MFL