Gene GWCH70_1506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1506 
Symbol 
ID7976593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1578298 
End bp1579257 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content48% 
IMG OID644798403 
ProductBile acid:sodium symporter 
Protein accessionYP_002949576 
Protein GI239826952 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID[TIGR00841] bile acid transporter 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.221367 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGATAT TAGCGAAATT TAGCAACTTC GTTGGAAACA CCTTTGCTGT CTGGGTGCTT 
CTTTTTGCTG CTCTTGCTTT TTATGATCCA AACGCTTTTA CATGGATCGC TCCATACATA
GTCCCGCTGC TTGGCGTTGT CATGTTTGGC ATGGGTCTTA CCTTATCACC CAATGATTTT
AAAGAAGTAT TCAAACGCCC GGTAGAGGTG CTGATCGGTG TAGCCGCCCA ATTTCTCATT
ATGCCGCTTG TCGCTTTTTT GCTTGCTCGT TATTTGCCGG TTTCGCGGGA AGTCGCTCTT
GGCATTCTTC TCGTCGGCTG TTGTCCGGGT GGAACAGCAT CTAACGTGAT GACCTATTTG
GCCAAAGGAG ATACCGCGCT GTCGGTCGCC GTGACATCCG TTTCTACGAT ACTGGCGCCG
ATTTTGACGC CGTCGCTGAT GTTGCTGCTT GCCGGAAAAT GGCTGTCCGT TTCTGCCGCG
GCATTGTTTT GGTCAATTGT AAAAGTTGTC CTAATTCCGA TTATTTTTGG GTTGATTGTG
CAAGCCCTTT TCCAAAAGCA AGTGAAAGCG TTCATCCCAG TCCTGCCGCT CGTTTCTGTC
ATTGCGATTG TCGCCATTGT CGCGGCTGTC GTTGGGCAAA ACCAGCAGGC AATTGCAAAA
AGCGGGCTTG CCATCTTCTT AATCGTCGTC ATTCATAACG GGCTTGGATT ATTGCTTGGC
TATTGGTTTG CAAAATTATT TCGTTTGTCA GCGCCAAAGC AAAAAGCGAT TTCCATTGAA
GTCGGCATGC AAAATTCCGG GCTTGGCGCG GCGTTAGCGA CCGCTCATTT TTCACCGCTT
GCCGCTGTTC CAAGCGCTAT TTTCAGCGTC TGGCATAACA TTTCCGGTCC GATTGCCGCA
ACCTATTTCC GTAAAAAAAG CGAGAAGGAA GAAGCGGAAC AATCGACGTT TTCGGCATAA
 
Protein sequence
MGILAKFSNF VGNTFAVWVL LFAALAFYDP NAFTWIAPYI VPLLGVVMFG MGLTLSPNDF 
KEVFKRPVEV LIGVAAQFLI MPLVAFLLAR YLPVSREVAL GILLVGCCPG GTASNVMTYL
AKGDTALSVA VTSVSTILAP ILTPSLMLLL AGKWLSVSAA ALFWSIVKVV LIPIIFGLIV
QALFQKQVKA FIPVLPLVSV IAIVAIVAAV VGQNQQAIAK SGLAIFLIVV IHNGLGLLLG
YWFAKLFRLS APKQKAISIE VGMQNSGLGA ALATAHFSPL AAVPSAIFSV WHNISGPIAA
TYFRKKSEKE EAEQSTFSA