Gene Gbro_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGbro_0042 
Symbol 
ID8549369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGordonia bronchialis DSM 43247 
KingdomBacteria 
Replicon accessionNC_013441 
Strand
Start bp52016 
End bp53803 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content62% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003271293 
Protein GI262200085 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATTC ATCCCAGCAT CAACCACTCT GAAGTCCACA CCAGCGACAG GATCTTCACT 
CAAGCAGTGG TCGTCGTGAA CCGCTCCGGT GCCGCAGCGA TGATCGAGGC GTGGCGAGCC
GAGAGCGTGG CCGCACCCCG CGGACCGTTC CGAGGGGCCC TGTCCTACAC AGTCGAAAGC
GTCCTCGTCG CCCTTGCCTG CGTGCTGCTC AGACGAGCCG AACCTACGGT CCGCACGATT
TTCCGCACCC TGCTCGACTT CACCCCTGAC CAGCTTGCCC AGCTCGACAT CGCCGACGCC
GATCTCACCG CCATCAGGTC CGATGCTGAC CGAGCCTTGA AAAGCTTCAG GAACCGGCTT
GATCGAACAC TGGCCTGCCT GGACTCGGCA CGCGATCAGC CCGCAGTACG AATTCCCGTC
GCCCAGCACA AGGCGATCAT CCGAGCCCGA ACCCCTGAGC AACAAGAAGC ATACGCGATA
GCGGCGCAAC GACTCTCGAC TGTCGTCAAT CGCATTCTGG CCGGCTCGAT TCCCACCGAA
TACCAACAAA GCGGCCGCGG CGATGTCGTC GTCGACGAGA CCATCATCGA CACGTCAGAT
GGGACCTACG ACCTTGGCGT CACCGACGAC CGCAACCGGT CGGCCATCTA CTTCGGTGGC
TACTACCGAA GGGACTATCG CAATCGCGTC GACGCCGAGG GGAATCCGCT GACGAAGAAA
CGGGCGTGGG GAATCGGTGT CACGGCGGTC AGCAGTGTCG GGCCGCCCGA TGCCTTGCAT
AACCGGCCGA TCCTGTTCAC CGGCATAGCT ATTCACCCAC CGACGTCGGG ATCACTCGAA
GGGCTCGACG AAGCCATCGA GCATCATCAA CGCAACGGGT TCGACAGTCG ATCCGCAAGC
CGAACCGCCA GATGGCCGCT GATCACCTTC GACATGGGCT ATTTGAAAGA CGGCCTCGAT
CGATGGCAAT TTGATCGCAA ATATGCTGGC GTCTTCCGCT ATCCCGACCA CTGGCGACGC
GACTTCGAGA GCGTTCCCGC CCAGCCCGGC GGCGCCAAAC CGGGACCCGT CCAACTTTCG
GGTGCTTGGT ACTGCCCGAC AGCTGCCGGC ATCTCCCTGG GCAAGAACTA CGTCAAGCCC
TTGCGCGACG TGCTCAATAA GGACGAATGG GAAGCTCGCG AACGCCGACT TCGTCAGCTC
CTGCCACGGC TGATGGGGGT CGACAGACGA CTGCTTGAAC GCAATACTCG ACCAGGCCGC
CCCGCCGAAG GTACTCAGCC TGCTAAATCC GTCAAGCTGG TGCTGACCTG CCCGGCAAGC
ATCGGCAACG TGAGATGCGC GAGGTGGCAC AACGCCGAGA CCGAGGACCG GCTCGACCTG
CCCTACATCG AACCCGAGCC CGACATGCCG TACTTCCCGT GTTGCACACA GCGCAGCGTC
ACAATCACGC TGACCGACGA CCAACGAAAG CGTCAACAGC TCAGTCAGTG GGCACCCGGA
TCCAACGACC ACGCGATCTA CCACGAGGCT GCCCGCGCGC TCACCGAGCA ACGCTTCAAT
CTGATCAAAT CGCGGACAGT CGCCGGCCTG GTCCATCTCA AGTACGGGCC GCGCCGCGAA
CCGCTGGTCA AGCTCATCAT CGCGATGGCG TTCGCCGTCG TGAACGTTCG CGAGATCGAG
CGATTCGAAT CGTCAAACCG TGACCTCCCC GAATCAATCG CCGCGAAATG GCGCCGACTC
GAAGCGGACC TCGGACAGCC GCCGATCCGA ATGCCCAACC GCACATGA
 
Protein sequence
MSIHPSINHS EVHTSDRIFT QAVVVVNRSG AAAMIEAWRA ESVAAPRGPF RGALSYTVES 
VLVALACVLL RRAEPTVRTI FRTLLDFTPD QLAQLDIADA DLTAIRSDAD RALKSFRNRL
DRTLACLDSA RDQPAVRIPV AQHKAIIRAR TPEQQEAYAI AAQRLSTVVN RILAGSIPTE
YQQSGRGDVV VDETIIDTSD GTYDLGVTDD RNRSAIYFGG YYRRDYRNRV DAEGNPLTKK
RAWGIGVTAV SSVGPPDALH NRPILFTGIA IHPPTSGSLE GLDEAIEHHQ RNGFDSRSAS
RTARWPLITF DMGYLKDGLD RWQFDRKYAG VFRYPDHWRR DFESVPAQPG GAKPGPVQLS
GAWYCPTAAG ISLGKNYVKP LRDVLNKDEW EARERRLRQL LPRLMGVDRR LLERNTRPGR
PAEGTQPAKS VKLVLTCPAS IGNVRCARWH NAETEDRLDL PYIEPEPDMP YFPCCTQRSV
TITLTDDQRK RQQLSQWAPG SNDHAIYHEA ARALTEQRFN LIKSRTVAGL VHLKYGPRRE
PLVKLIIAMA FAVVNVREIE RFESSNRDLP ESIAAKWRRL EADLGQPPIR MPNRT