Gene GWCH70_3173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3173 
Symbol 
ID7977026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3201932 
End bp3203404 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content48% 
IMG OID644799958 
ProductNa+/solute symporter 
Protein accessionYP_002951097 
Protein GI239828473 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCGG CATTGGTGAT TATTTTCGCT TTCCTGTTGC TTTCCCTCTA TTTAGGGGTG 
CAAGCGCGCA AAGGAAAAGA TATGAACTTG GAACAGTGGA CGGTCGGCGG CCGCGGGTTT
GGCACGATCT TTGTCTTTCT CCTCCTGGCC GGCGAAATTT ATACGACATT TACCTTTTTA
GGCGGAAGCG GTTGGGCTTA CGGCAAAGGG GGGCCGACGT TTTACATTAT CGCGTACGGC
TGCTTGGCGT ATGTGCTGTC ATATTGGATG CTTCCAAAAG TATGGAAATA TGCGAAAGAG
CATCAGCTGA TGTCCCAATC TGACTTTTTT GTTAGCAAAT ACAACAGTCC GCTGTTAGGG
GTGCTCGTTT CCCTAGTCGG GGTGGTCGCG CTTATTCCGT ATTTGGTGCT GCAGCTAAAA
GGATTGGGTA TTATCGTTTC TCAAGCTTCA TACGGCACGA TTTCGTCGAC GGCGGCGATA
TGGATCGGCG TTCTTTCCGT TACAGTGTAC GTGATGATTT CGGGCATCCA CGGCTCGGCG
TGGACAGCGG TAGTGAAAGA TATTATGATT TTAGTTGTCG CGGTCTTTTT AGGTCTTTAT
CTTCCATTCC ATTATTACGG CGGAATTCAG CCGATGTTTG AAGCAATTGA ACAAGCAAAA
CCAGGGTTTC TCGTATTGCC GGACAAAGGC ATGAGCGTGT CTTGGTTTAT TTCCACCGTG
CTGTTAACGG TGCTTGGTTT TTATATGTGG CCGCATACGT TCGGTTCGAT TTATTCGGCG
AAAAGCGCCA ACGTTTTTCG GAAAAATGCG ATGATTTTGC CTCTTTATCA GTTAGTGCTG
CTGTTTGTCT TTTTTGTCGG CTTTGCAGCG ATTTTGCAAA TTCCGCATTT AGAAGGCTCC
GATGCTGATC TGGCTTTATT GCGGTTGTCC ATTCAAACGT TTGACCCTTG GGTCGTCGGG
CTGATCGGTG CAGCGGGGCT GTTGACGGCA ATGGTTCCGG GTTCGATGAT TTTAATGACA
GCATCGACGT TGCTTGCGAA AAACGTCTAT AAAGTGTTTT CTCCTTCCGC TACCGACGAT
CAGGTCACGA AACTAGCGAA ATATCTCGTC CCTGTCATTG CGTTAGTTTC GTTATATTTC
ACGTTCCGCG GCGGAAATAC GATCGTTGCA CTGCTTCTTA TGGGATATAG CCTCGTGACG
CAGTTGTTCC CATCGTTTGT GCTCAGCCTG ATGAAAAACA ATTTTGTGAC GAAACAAGGA
GCGTTTGCCG GTATTATCGC TGGTGTTGCG ACAGTTGCCT ATATTACATT ATCGGGCAGC
AGCATCGGCA CGTTGTTCCC TTCTCTGCCG CAGGTCGTGC AAGATCTGAA TGTCGGAATT
ATCGCATTGA TTGTGAATAT CGTTGTCACG GTGGTCGTAA GCTGGATTCC GACCCGATCT
GTCAGCGTCG ACACAGAAAA AAGCGTGATG TAG
 
Protein sequence
MNAALVIIFA FLLLSLYLGV QARKGKDMNL EQWTVGGRGF GTIFVFLLLA GEIYTTFTFL 
GGSGWAYGKG GPTFYIIAYG CLAYVLSYWM LPKVWKYAKE HQLMSQSDFF VSKYNSPLLG
VLVSLVGVVA LIPYLVLQLK GLGIIVSQAS YGTISSTAAI WIGVLSVTVY VMISGIHGSA
WTAVVKDIMI LVVAVFLGLY LPFHYYGGIQ PMFEAIEQAK PGFLVLPDKG MSVSWFISTV
LLTVLGFYMW PHTFGSIYSA KSANVFRKNA MILPLYQLVL LFVFFVGFAA ILQIPHLEGS
DADLALLRLS IQTFDPWVVG LIGAAGLLTA MVPGSMILMT ASTLLAKNVY KVFSPSATDD
QVTKLAKYLV PVIALVSLYF TFRGGNTIVA LLLMGYSLVT QLFPSFVLSL MKNNFVTKQG
AFAGIIAGVA TVAYITLSGS SIGTLFPSLP QVVQDLNVGI IALIVNIVVT VVVSWIPTRS
VSVDTEKSVM