Gene GWCH70_0467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0467 
Symbol 
ID7978618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp519517 
End bp520791 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content43% 
IMG OID644797444 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002948644 
Protein GI239826020 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000451235 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATC GTTGGTTCAT CGCTTTATCA GCAGTAGGAA TTCATATTTG CATCGGTTCT 
GTTTATGCAT GGAGTAATTT TACGAATCCA TTAAAACAAA TGTTTGGGTG GTCGGATCAA
GAAGTTGCGC TAACATTTAG CATTGCCATT TTGTTTTTAG GTTTATCAGC GGCATTCCTT
GGTCACTTTG TCGAAAAACA CGGTCCTAGA AAAGCAGGAT TACTTGCTGC AACGTTCTTT
GGGATCGGCA TTGCTGGTTC CGGATTTGCG GTTGTTTTGG AATCGAAGCA TTTGCTTTAC
CTATTCTATG GGGTGTTAGG CGGAATTGGT CTTGGAGTAG GATATATTAC ACCTGTTTCT
ACGCTGGTGA AATGGTTCCC AGATCGTCGT GGTTTTGCTA CGGGGCTTGC GATTATGGGA
TTCGGATTTG CCGCCGCCAT TTCAAGTCCT GTGATGAACA GTTTGATCGG TTCGGTTGGC
GTTAGTAACA CGTTTTATAT TTTAGGTGCT GTTTACTTTT TGATTATGGC TTTCTCTTCT
CTTTACTTAG AGAAGCCGCC TGAAGGCTGG ATGCCTGAAG GCTTTAAAGA AAAAGTGAAG
GCTGGAAAAG CAAAACCGTT GATGGATTTA TCACAATTGA CCGCAAATGA AGCAATTAAA
ACAAGACGTT TTTGGTATTT ATGGATGATG TTGTTTATCA ATGTAACATG CGGCATTGCT
ATCTTAGCTG TGGCAAAGCC GCTTGCCATG GAAAGTATCG GTATTGACCA AGCTGCGGCT
GCGGCATTGG TTGGAGCCAT TGGAGTATTT AACGGCTTAG GGCGTATCGG CTGGGCATCT
GCCTCTGATT ACATCGGAAG ACCGAATACC TATACAGCGT TCTTCGTTTT ACAAATTATC
ATCTTCTTTT TCTTGCCGGA CGTTTCGGTC AAGTGGTTGT TTATGGGAAT GTTGATTATC
GTGTACACTT GCTACGGCGG AGGATTTGCG TGCATTCCTG CGTATATCGG AGACTTGTTT
GGTACAAAGC AATTAGGTGC CATTCATGGT TACATTTTGA CCGCTTGGGC AGCCGCAGGA
CTTGTAGGCC CGCTATTTGC TGCTTACATC AAGGATACTA CCGGTTCTTA TGAAGGCAGC
TTGACCTTCT TCGCAGGACT ATTTGTTATC GCTTTGGCTG TTTCTCTGCT GGTGCGCATT
GATATCCGTC AATTACGAGA GAAAAATGCT CAATCTATAT ATCCGTCAAT TACAGAAGAA
AAAAGTACAA TCTAA
 
Protein sequence
MKNRWFIALS AVGIHICIGS VYAWSNFTNP LKQMFGWSDQ EVALTFSIAI LFLGLSAAFL 
GHFVEKHGPR KAGLLAATFF GIGIAGSGFA VVLESKHLLY LFYGVLGGIG LGVGYITPVS
TLVKWFPDRR GFATGLAIMG FGFAAAISSP VMNSLIGSVG VSNTFYILGA VYFLIMAFSS
LYLEKPPEGW MPEGFKEKVK AGKAKPLMDL SQLTANEAIK TRRFWYLWMM LFINVTCGIA
ILAVAKPLAM ESIGIDQAAA AALVGAIGVF NGLGRIGWAS ASDYIGRPNT YTAFFVLQII
IFFFLPDVSV KWLFMGMLII VYTCYGGGFA CIPAYIGDLF GTKQLGAIHG YILTAWAAAG
LVGPLFAAYI KDTTGSYEGS LTFFAGLFVI ALAVSLLVRI DIRQLREKNA QSIYPSITEE
KSTI