Gene Gbro_4237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGbro_4237 
Symbol 
ID8553618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGordonia bronchialis DSM 43247 
KingdomBacteria 
Replicon accessionNC_013441 
Strand
Start bp4533988 
End bp4535256 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content65% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003275277 
Protein GI262204069 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACGCC TCTACCGCAG TTTCCGCAGC TTCGATCGCC CCAGCCAAGT GCTGATGGTC 
AATCAGTTCA CCATCAATCT CGGCTTCTAC ATGCTCATGC CCTACCTCGC CGCCTACCTC
GCCGGCCCGC TGGCCCTGGC CGCATGGGCA GTGGGCCTGG TCCTGGGGAT ACGCAACTTC
TCCCAACAGG GCATGTTCCT GATCGGGGGA ACACTCGCTG ACCGCTGGGG TTACAAACCG
CTCATCATCG CCGGATGCCT GATGAGGGTC ATCGGATTCG TCCTACTGGC CCTGGTCAGT
TCGCTGCCGG CGATCCTGAT CGCCTCGGCG GCAACGGGAT TCGCCGGGGC GCTGTTCAAC
CCCGCCGTAC GGGCCTACCT GGCCGGCGAC GCCGGTGAGC GCCGCATCGA AGCATTCGCC
GTGTTCAACG TGTTCTACCA GGCCGGGATC CTGCTCGGGC CGCTGGTCGG ACTAGCCCTG
ACCGCCCTGG ACTTCCGGCT CACCTCTCTC TGTGCGGCAG CAGTTTTCGC GGTCCTGACC
CTCGTGCAGA TCAAAGCGCT ACCCGCAACA ACACCTGCCC CGAGCAGCAC CTCAATCATT
CAGGACTGGC GCAGCGTCGT CCGTAACCGG CGATTCCTGC TCTTCGCTGC GGCGATGGCC
AGCTCTTACG TGCTGTCCTT CCAGATCTAC CTCGCCCTGC CCCTACATGC CGACCGCATA
GCAGACAACC CCACCCTCGC CACCAGTGTT GTCACCGCAA TGTTCGTGGT GACCGGTCTG
GTGGCCATCG CCGGCCAGCT ACGCATCACC ACATGGTTCG GCAACCGCTG GGGCAGCACC
GGCAGCCTGA GCGTGGGAAT GACGCTGATG GCTGCAGCTT TCCTGCCCCT AGTCGCCAGC
ACCGCAACGC ATCAGCGCAA CATCGCCCTC AACATCGCCG CCCTGCTACT CACCGCAGCT
CTGCTCGCCG CCGCCACCGC AGCAGTGTTC CCATTCGAAA TGGACACGGT GGTGGGACTG
GCCCGCGGCA CGCTTGTCGC CACCCACTAC GGGCTCTACA ACACCATCGT CGGGATCGGG
ATTCTGCTCG GCAACGCCGC GACCGGGTGG CTGTTCAGCG CGGCCACCAC CCGCGACATG
CCCGAACTGG TGTGGATCGC TCTAGTCCTC ACCGGCATGG CGGCGGCGTC TGCCCTGTTG
CTGTTGCACC GCCGCGGATG GCTCAGTGTC GCCGCCCCCA CCGAAGCTCA GCCCACCACA
ACCCATTGA
 
Protein sequence
MRRLYRSFRS FDRPSQVLMV NQFTINLGFY MLMPYLAAYL AGPLALAAWA VGLVLGIRNF 
SQQGMFLIGG TLADRWGYKP LIIAGCLMRV IGFVLLALVS SLPAILIASA ATGFAGALFN
PAVRAYLAGD AGERRIEAFA VFNVFYQAGI LLGPLVGLAL TALDFRLTSL CAAAVFAVLT
LVQIKALPAT TPAPSSTSII QDWRSVVRNR RFLLFAAAMA SSYVLSFQIY LALPLHADRI
ADNPTLATSV VTAMFVVTGL VAIAGQLRIT TWFGNRWGST GSLSVGMTLM AAAFLPLVAS
TATHQRNIAL NIAALLLTAA LLAAATAAVF PFEMDTVVGL ARGTLVATHY GLYNTIVGIG
ILLGNAATGW LFSAATTRDM PELVWIALVL TGMAAASALL LLHRRGWLSV AAPTEAQPTT
TH