Gene Gobs_3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_3044 
Symbol 
ID8754720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp3190704 
End bp3192089 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content72% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003410025 
Protein GI284991471 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0440547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGCCC TCACCCCCAC CACCCGACAG CCGACCGCGC CGGTCACCGC CGCGCCGCGC 
CGCGGTCGCT GGATCGACGA CTGGCGCCCG GAGGACCCCG CCTTCTGGGA GTCCGCGGGC
AAGCCGGTCG CCCGCCGCAA CCTGTTCTTC TCGGTCTTCT CCGAGCACAT CGGCTTCTCG
ATCTGGAGCC TGTGGTCGGT GCTCGTGCTG TTCCTGCCCG AGGCCGTGTA CGGCATCGAC
CCGGCCGGGA AGTTCCTGCT GACCACGCTG CCGACGGCGC TGGGCGCATT CGTCCGGCTG
CCCTACACCT TCGCCGTCGC GAGGTTCGGC GGCCGCAACT GGACGATCGT CAGCGCCGCG
CTGCTGCTGG TGCCCACGAT CGCCACGGCC GTCGTCCTGG AGCCCGGCGT CGACTACACG
ACCCTGCTCG TCGTCAGCTG CCTCGCCGGT GTCGGTGGCG GCAACTTCGC CAGCTCCATG
GCCAACATCA ACGCCTTCTA CCCGAACCGG CTCAAGGGCT GGGCGCTCGG CCTCAACGCC
GGCGGCGGCA ACCTCGGCGT CCCCGTCGTC CAGCTGGTGG GCCTGCTCGT GCTGGCCACC
GCGGGCGCCG AGCACCCGCG GCTGGTCCTG CTGGTCTACA TCCCGCTGAT CGCCGTCGCC
GCCGTCGGTG CGGCCCTGCT CATGGACAAC CTGACGACGG CGCGCAACCA GCCGCGGGCC
ATGCGCGAGG CCACCCGCGA GCCGCACACC TGGATCATGT CCTTCCTCTA CATCGGCACC
TTTGGCTCGT TCATCGGCTT CGGGTTCGCC TTCGGCCAGG TGCTGCAGAA CCAGTTCACC
GGCGACTTCG CCACGCCCCT CGCCGCCGCG TCGCTGACCT GGCTCGGCCC GCTGCTGGGC
TCGCTGATCC GCCCGCTCGG CGGCTCGCTG GCCGACCGCT TCGGCGGTGC CCGCATCACG
TTCTGGAACT TCGCGGCGAT GGCCGTCGGC GCCGGGATCG TGTGGAGCGC CAGCCAGGTG
GGGTCGCTGC CGCTGTTCGT CGTCGGCTTC GTGTCGCTGT TCGTGTTCAG CGGTCTCGGC
AACGGCTCGA CCTACAAGAT GATCCCGGCG ATCTTCCGCA CCCAGGCGCA GCAGCGGGTG
GCCGCCGGGG AGGACGGCGC CGTCGCCGAC CGGCACGCGC TGCGCATGTC CGGCGCGCTC
ATCGGCATCG CCGGCGCGGT CGGCGCCTTC GGCGGCGTGC TGGTCAACCT GGCCTTCCGC
CAGTCGTTCC TGGCCACCGG CACCGGCGAC TCGGCCTATC TGGTGTTCAT CGCCTTCTAC
CTGGTCTGCC TCGCCGTCAC GTGGGCGGTC TACCTGCGGC CCCGGGCGCC CATGTCCGGG
GTGTGA
 
Protein sequence
MAALTPTTRQ PTAPVTAAPR RGRWIDDWRP EDPAFWESAG KPVARRNLFF SVFSEHIGFS 
IWSLWSVLVL FLPEAVYGID PAGKFLLTTL PTALGAFVRL PYTFAVARFG GRNWTIVSAA
LLLVPTIATA VVLEPGVDYT TLLVVSCLAG VGGGNFASSM ANINAFYPNR LKGWALGLNA
GGGNLGVPVV QLVGLLVLAT AGAEHPRLVL LVYIPLIAVA AVGAALLMDN LTTARNQPRA
MREATREPHT WIMSFLYIGT FGSFIGFGFA FGQVLQNQFT GDFATPLAAA SLTWLGPLLG
SLIRPLGGSL ADRFGGARIT FWNFAAMAVG AGIVWSASQV GSLPLFVVGF VSLFVFSGLG
NGSTYKMIPA IFRTQAQQRV AAGEDGAVAD RHALRMSGAL IGIAGAVGAF GGVLVNLAFR
QSFLATGTGD SAYLVFIAFY LVCLAVTWAV YLRPRAPMSG V