Gene Gobs_4539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4539 
Symbol 
ID8756237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4760537 
End bp4761628 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content69% 
IMG OID 
Productthiamine pyrophosphate protein domain protein TPP-binding protein 
Protein accessionYP_003411460 
Protein GI284992906 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0552046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCG TCGAACTCGG CATGCCCGCC GAGGGCCCCA TTGCCGACGC CATCGCCCGC 
TCCCTGGAGA CCCAGGGCGA GAAGCGCGCC GAGCTCAAGG CCAAGGACCT CAAGACCGAC
CAGGAGGTGC GCTGGTGCCC GGGGTGCGGT GACTACGTCA TCCTCAACGC GGTCCAGAGC
TTCCTGCCCA GCCTCGGCAT CGCGCGCGAG GACATGGTCA TCGTGTCGGG CATCGGCTGC
TCGTCGCGCT TCCCGTACTA CATGAACACC TACGGGATGC ACTCGATCCA CGGCCGCGCG
CCGGCGATCG CGACCGGTCT CGCCGCGTCC CGCCCGGACC TGTCGGTGTG GGTCGTCACC
GGGGACGGCG ACGCGCTGTC CATCGGCGGC AACCACCTGA TCCACACGCT GCGCCGCAAC
GTGAACCTGA AGATCCTGCT GTTCAACAAC CGGATCTACG GGCTCACCAA GGGCCAGTAC
TCGCCCACCA GCGAGGTCGG CAAGGTCACC AAGTCCACCC CGATGGGCTC GCTGGACCAC
CCGTTCAACC CGGTGTCGCT GGCGCTGGGC GCCGACGCCA CCTTCGTCGG CCGGGCCATG
GACTCCGACC GCAAGGGCCT CACCGAGGTG CTGCGGCAGG CCGCCGAGCA CCAGGGCACC
GCGCTGGTGG AGATCTACCA GAACTGCAAC ATCTTCAACG ACGGCGCGTT CGACCTGCTC
AAGGACCCGA CCACCGGCGT GCAGTGGTCC ATCCCGCTGG TCCACGGCCA GCCGCTGGTG
TTCGGTCCGG ACGGCGCCTC CTGCGTGGTC CGCGACGACT TCGGCGGCCT GCGGATCGCC
GAGACCAACC AGGTGGACGC CGACGACATC GTCGTGCACG ACGCCACCCG GGAGGACCCG
TCGTACGCCT TCGCGCTGTC GCGGCTGTCC AGCCAGGACC TCCGCTACAC CCCGATGGGC
GTCTTCCGGT CGGTGCCGAA GCCGACCTAC GACAAGATGA TGGCCGACCA GGTCGAGGAG
GCCCGCACCT CCTCGCCGGC GGACCTGGGC GCGCTGCTCG CCGGCAACGA CACCTGGACC
GTCTCGGCCT GA
 
Protein sequence
MTTVELGMPA EGPIADAIAR SLETQGEKRA ELKAKDLKTD QEVRWCPGCG DYVILNAVQS 
FLPSLGIARE DMVIVSGIGC SSRFPYYMNT YGMHSIHGRA PAIATGLAAS RPDLSVWVVT
GDGDALSIGG NHLIHTLRRN VNLKILLFNN RIYGLTKGQY SPTSEVGKVT KSTPMGSLDH
PFNPVSLALG ADATFVGRAM DSDRKGLTEV LRQAAEHQGT ALVEIYQNCN IFNDGAFDLL
KDPTTGVQWS IPLVHGQPLV FGPDGASCVV RDDFGGLRIA ETNQVDADDI VVHDATREDP
SYAFALSRLS SQDLRYTPMG VFRSVPKPTY DKMMADQVEE ARTSSPADLG ALLAGNDTWT
VSA