Gene Gobs_4398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4398 
Symbol 
ID8756092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4630718 
End bp4631995 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content75% 
IMG OID 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_003411324 
Protein GI284992770 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCTTCG ACGCGATCGA CGTCATCCGC ACCAAGCGGG ACGGCGGGCG GCTGAGCGCC 
GAGCAGATCC GCTGGGTGAT CGACGCCTAC ACCCGCGGCG TCGTGCCCGA CGAGCAGGTC
AGCGCGCTGC TCATGGCGGT GTTCTTCCGC GGCATGGCGC CCGAGGAGCT CGCCGTCTGG
ACGCAGGCGA TGATCGACTC CGGCGAGCGC AAGGACCTGT CCCCGCTCGG CCGGCCGACC
GCCGACAAGC ACTCGACCGG CGGCGTCGGC GACAAGACCA CGCTGCCGCT CGCGCCGCTG
GTCGCCGCCT GCGGCGTCGC GGTGCCGCAG CTGTCCGGGC GGGGCCTGGG CCACACCGGC
GGCACCCTGG ACAAGCTCGA GTCGATCCCC GGCTGGCGGG CCGACGTCCA CGAGGAGGCC
TACCTGGCGC AGCTGCGCGA GGTGGGCGCG GTCATCTGCG CGGCCGGCAA CGACCTGGCG
CCGGCGGACA AGAAGCTGTA CGCGCTGCGC GACGTCACCG GCACCGTCGA GTCGATCCCG
CTGATCGCCA GCTCGATCAT GAGCAAGAAG ATCGCCGAGG GTGCCGACGC CCTGGTGCTC
GACGTGAAGA CCGGGTCGGG CGCGTTCATG AAGGACCCCG AGGCATCCCG CGAGCTGGCC
CGCACCATGG TCGGCCTCGG TGAGGCGGCC GGCGTGCACA CGGTCGCGCT GGTGACCGCG
ATGGACCGGC CGCTGGGTCG CGCCGCCGGC AACGCCGTCG AGGTGGCCGA GTCGGTGGAG
GTGCTCGCCG GCGGTGGCCC GGCCGACGTC GTCGAGCTGA CCCTGGCGCT GGCCCGCGAG
ATGCTGGCCG GCGTCGGGCG GGGCGACGTC GACCCGGCCG AGGCCCTCCG CGACGGGCGG
GCCATGGACG TCTGGCGGCG GATGATCAGC GCGCAGGGCG GCGACCCCGA CGCGCCGCTG
CCGCACCCGG CCGAGAAGCA CGTGGTGGTC GCTCCGGCCA CCGGCACGCT GACCCGGCTG
GACGCCTACG CCTTCGGCGT GGCCGCCTGG CGGCTGGGTG CGGGGCGGGC GCGCAAGGAG
GACCCGGTGT CCGCGGCGGC CGGTGTCACC TGGACCGCCG GGGTGGGCGA GCAGGTGGTG
GCCGGTCAGC CGCTGCTGGA GCTGCACACC GACGACCCGG ACCGCATCCC GCGGGCGCTG
GAGGCGCTGG AGGGCGCGGT CGGCGTCGAC ACCGGTGAGC AGCCCCTGCC GCTGCTGCTG
GACCGCATCA CGGCCTGA
 
Protein sequence
MTFDAIDVIR TKRDGGRLSA EQIRWVIDAY TRGVVPDEQV SALLMAVFFR GMAPEELAVW 
TQAMIDSGER KDLSPLGRPT ADKHSTGGVG DKTTLPLAPL VAACGVAVPQ LSGRGLGHTG
GTLDKLESIP GWRADVHEEA YLAQLREVGA VICAAGNDLA PADKKLYALR DVTGTVESIP
LIASSIMSKK IAEGADALVL DVKTGSGAFM KDPEASRELA RTMVGLGEAA GVHTVALVTA
MDRPLGRAAG NAVEVAESVE VLAGGGPADV VELTLALARE MLAGVGRGDV DPAEALRDGR
AMDVWRRMIS AQGGDPDAPL PHPAEKHVVV APATGTLTRL DAYAFGVAAW RLGAGRARKE
DPVSAAAGVT WTAGVGEQVV AGQPLLELHT DDPDRIPRAL EALEGAVGVD TGEQPLPLLL
DRITA