Gene GWCH70_3386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3386 
Symbol 
ID7976167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3412664 
End bp3414118 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content46% 
IMG OID644800151 
Productsodium/proline symporter 
Protein accessionYP_002951290 
Protein GI239828666 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTATTGA TTTCTGTCGC AGTGTATATG ATTGGGATGC TTCTTATTGG GTATTGGGCT 
TATAAACGTA CGTCCAACCT TTCCGATTAT ATGCTTGGGG GAAGGACGTT AGGCCCTGCG
GTCACAGCGC TCAGTGCGGG GGCTTCCGAT ATGAGCGGTT GGCTGTTGAT GGGGCTTCCA
GGAGCAATGT ATGCGCAAGG ATTAAGTGCA TCATGGATTG TTATTGGGCT TACGCTCGGA
GCATATGCGA ATTGGTTATA TGTTGCGCCT CGTTTGCGTG TATATACGGA AGTAGCAAAT
GATTCCATTA CCATCCCGGA ATTTTTAGAA AATAGATTTG GCGATACATC GAAGCTGCTT
CGGTTAATTT CCGGTCTTGT TATTATGATT TTCTTTACCT TTTATGTATC TTCTGGTCTT
GTATCTGGAG CGGTGCTGTT TCAAAACTCA TTTGGCGCAA GTTATCATAC AGGATTATGG
ATTGTTGCCG GCGTCGTTGT GGCGTATACA TTGTTTGGAG GATTTTTGGC TGTTAGTTGG
ACCGATTTTG TGCAAGGAAC GATTATGTTT ATTGCTCTTA TTCTTGTCCC GGCCGTAACG
CTTTTCCATA CGGGCGGTGT CGGCGATACG TTTACTACCA TTAAAAACAT TGATCCTAAT
TTGCTCGATT TATGGAAAGG AACTAGCTTC CTCGGTATTA TTTCGCTGTT TGCGTGGGGG
CTTGGCTATT TCGGGCAGCC GCACATTATT GTCCGCTTTA TGGCGATTTC GTCGGTCAAG
GAAATGAAAA GCGCCCGCCG CATCGGAATG GGGTGGATGA TTTTCTCCGT TGTCGGCGCG
ATGTTGACAG GGCTGTTTGG AATCGCTTAC TTTTCACAGC ACGGCACTAA GCTCGATGAT
CCGGAGACCG TATTTATCAA GCTTGGAGAA ATTTTATTCC ATCCGCTCAT CACCGGATTT
TTGCTTGCGG CGATTTTAGC GGCCATTATG AGTACGATTT CTTCGCAGCT TCTTGTTACG
TCCAGTTCCT TAACAGAGGA TTTATATAAA GTGGTATTCC GTCGTTCCGC TTCGGATAAA
GAGTTGATTT TCGTCGGCCG TCTTTCCGTA TTAATTGTAG CGTTAGTAGC GTCCGCGTTC
GCGTACACGA AAAACGATAC GATTTTAAAC TTGGTCGGTT ATGCGTGGGC AGGATTCGGT
GCTTCGTTTG GTCCAGTCAT TTTATTAAGC CTGTTCTGGC GCCGCATGAC GAAATGGGGG
GCGTTTGCCG GCATGGTCGC AGGAGCGATG ACCGTGATTC TTTGGACACA ATCGGAATAT
TTGAAAAACT TGCTGTATGA GATGATTCCA GGTTTTGCAG CAAGCTTGGC CGCGATTGTT
GTCGTTAGCT TGTTGACAAA AGCGCCGGAA GAAAAAGTTG TCGAGCAGTT TGACAAATTT
AAAGCATCGT TATAA
 
Protein sequence
MVLISVAVYM IGMLLIGYWA YKRTSNLSDY MLGGRTLGPA VTALSAGASD MSGWLLMGLP 
GAMYAQGLSA SWIVIGLTLG AYANWLYVAP RLRVYTEVAN DSITIPEFLE NRFGDTSKLL
RLISGLVIMI FFTFYVSSGL VSGAVLFQNS FGASYHTGLW IVAGVVVAYT LFGGFLAVSW
TDFVQGTIMF IALILVPAVT LFHTGGVGDT FTTIKNIDPN LLDLWKGTSF LGIISLFAWG
LGYFGQPHII VRFMAISSVK EMKSARRIGM GWMIFSVVGA MLTGLFGIAY FSQHGTKLDD
PETVFIKLGE ILFHPLITGF LLAAILAAIM STISSQLLVT SSSLTEDLYK VVFRRSASDK
ELIFVGRLSV LIVALVASAF AYTKNDTILN LVGYAWAGFG ASFGPVILLS LFWRRMTKWG
AFAGMVAGAM TVILWTQSEY LKNLLYEMIP GFAASLAAIV VVSLLTKAPE EKVVEQFDKF
KASL