Gene Gbro_3201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGbro_3201 
Symbol 
ID8552574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGordonia bronchialis DSM 43247 
KingdomBacteria 
Replicon accessionNC_013441 
Strand
Start bp3429519 
End bp3431165 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content69% 
IMG OID 
ProductDAK2 domain fusion protein YloV 
Protein accessionYP_003274299 
Protein GI262203091 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCGCCC GGACCATGAA CCCACAGGTG TTGCGCGACT GGGCACGCGC ATCTGCGGAG 
AACCTCGAGG CACTGCGCGG CGAGATCAAC GACCTCAACG TCTTCCCGAT TCCCGACTCC
GACACCGGTA GCAACATGGT CTTCACCATG ACGGCGGCGG CCGACGCAAC CGACGCCCTT
CCGCAGGACG CCGGCGTGGC CGACGTCGCA CGCGCGATGG CCGACGGTGC GGTCGCCGGA
GCGCGTGGCA ATTCCGGGAT CATCCTCTCG CAGGTCCTGG TGGGACTCGC GGACGCGGCC
GAACTGCTCG ACGGTCCGGA GCTGACCTTC AAGAATCTGG TGGCCTCGGG CCTGCGGCTC
GGATCGCTGG CGGCCACCCG GGCGGTCAGC GAACCGCAGG AGGGCACGGT GCTCACCCTG
ATCAAGGTCG CGGCCACGGC GGCGGCCGAC CACGAATCAG ACACCGCCAG CGACCTCGCC
CGCGCGATCG CCGATGAGTG TGCCGACGCG CTCGAACGCA CCCCCGATCA ACTCCCGGTG
CTCGCCAGTG CCGGCGTCGT CGATGCCGGC GGGCGTGGTT TCCTCGCCCT GCTCGACGCG
ATGGTCTCGG TGCTGACCGG GGTGTCCAAT CGGCGACGCC GCTACCGCGG ATTCCTCACC
GGTGGTGGTC AGGCGGGGCA TCCGGAGGGC GAGACCTGCT CCGACGGCAG CGACATGGAC
TTCGAGGTCA TGTATCAGTT GCACGGCGCG GTCGGTGAGC GGATCGCCGA CCTGCGGCGC
TTCCTCGACG ATGTCGGGGA CGCGGTGGTC ATCGTCGGCG ACAGCTCCAG CGGTGACGGA
GAACGGTTCT CGGTTCACGT GCATACCTGT GATCCGGGTG CGGCGGTGGA GGCCGGGGTC
ACGCTCGGGC GGGTCTCGGA CATCCGGATC AGCTGCTTTG CGCTCGATGC CATTCGCGCA
CACGTGGATT CGGTGGAACC CCCGCCACGG TACAAGCGGG CGGTGGTCGC CGTGGTGACC
GGCGACGGCG CGGCCGAGTT GTTCGCCGAA GCGGGTGCCA CCGTGCTGCG CGCCGACGAC
GGACTGACCG CCGAGCAACT CGCCGACACG ATCCGCGGTA CCGACAGCGC CCACGTGGTG
GTGATGGCCA ACGGTGCGCT GGCATCTCAG GAACTGGTGA CGGTGGCCAC CGAGGTGCGC
TCGCCGCAGC GGTCCATCGT CACCCTGCCG ACCTCGTCGA TGGTGCAGTG CCTGGCCGCC
CTGGCGGTGC ACGATCCCGG TGAGCAGGCC GACGCCGACG CCTACGCCAT GGCCGAGGCG
GCAGCCGGAA CCCGTTGGGG TTCACTCCAA CTGGCCGATC AGAAGATGAT GACCCTGGCC
GGGATGTGCG ACGTCGGTGA CGTCCTCGGC CTGATCGGTT CCGACGTCCT CGTCGTCGCC
CCCGATCAAA CCGCCGCGGC CACCGCGCTG GTGGATCTGA TGCTCGCCAC CGGTGGTGAG
ATGGTGACCG TGCTCGCCGG TGGTGAGGTG GATCCCGCCG CGCTCGACGC CATCACCGAG
CAGATGCGGC GCAGTCATCC CGGCATCGAG CTGGCCGTGT ACGAGACCGG ACAGTGCGGT
GATCTCATCG AGGTGGGCGT CGAATGA
 
Protein sequence
MIARTMNPQV LRDWARASAE NLEALRGEIN DLNVFPIPDS DTGSNMVFTM TAAADATDAL 
PQDAGVADVA RAMADGAVAG ARGNSGIILS QVLVGLADAA ELLDGPELTF KNLVASGLRL
GSLAATRAVS EPQEGTVLTL IKVAATAAAD HESDTASDLA RAIADECADA LERTPDQLPV
LASAGVVDAG GRGFLALLDA MVSVLTGVSN RRRRYRGFLT GGGQAGHPEG ETCSDGSDMD
FEVMYQLHGA VGERIADLRR FLDDVGDAVV IVGDSSSGDG ERFSVHVHTC DPGAAVEAGV
TLGRVSDIRI SCFALDAIRA HVDSVEPPPR YKRAVVAVVT GDGAAELFAE AGATVLRADD
GLTAEQLADT IRGTDSAHVV VMANGALASQ ELVTVATEVR SPQRSIVTLP TSSMVQCLAA
LAVHDPGEQA DADAYAMAEA AAGTRWGSLQ LADQKMMTLA GMCDVGDVLG LIGSDVLVVA
PDQTAAATAL VDLMLATGGE MVTVLAGGEV DPAALDAITE QMRRSHPGIE LAVYETGQCG
DLIEVGVE