Gene Jann_2986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_2986 
SymboldeoA 
ID3935456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3005149 
End bp3006459 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content68% 
IMG OID637905356 
Productthymidine phosphorylase 
Protein accessionYP_510928 
Protein GI89055477 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.433587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG CGCGCGCCAT TCTGACGAAG CTGCGCCAGG GCGACCGCCT GACGGAGGCA 
GAGGTGTTCT GGTTCGCCGA AGGGCTTGCG ACCGGCGACG TCACCGACGC TCAGGCAGGC
GCGTTTGCCA TGGCCGTGTG CCAAAACGGA CTGGGCGAGG AGGGCCGGGT GCAACTGACC
CGCGCGATGC GAGAGACGGG CCGCGTGATG GCATGGCACC TCGACGGGCC GGTGATCGAC
AAACATTCAA CGGGTGGCGT AGGGGATTGC GTGTCGCTGC TGCTGGCACC TGCGCTGGCC
GCTTGCGGAG CGTTCGTGCC AATGATTTCT GGGCGCGGTT TGGGGCACAC GGGCGGCACG
CTGGATAAGT TGGAGGCCAT TCCGGGCTAC AACACAGACG TCTCCCCCGA TGATCTGCAA
GAGATCGTGG CTGATATCGG CTGTGCCATC GTGGGCGCAT CGGGTGATAT CGCGCCCGCT
GACAAGCGGC TTTACGCGGT GCGGGACGTG ACGGCCACCG TCGCCTCGGT CGATCTGATC
ACGGCGTCGA TCCTGTCAAA AAAGCTCGCC GCCGGGTTGG AGGCATTGGT TCTGGATGTG
AAGGTCGGCT CGGGCGCGTT CATGGGCACG GAGGCGGAGG CGTTGGGCCT GGCGCAAGCG
CTGGTCGCGA CGGCACAAGG CGCGGGGTGC ATGACCACGG CGTTGGTCAC CGACATGAAC
CAACCCCTGG CCAGCAGCGC GGGCAATGCG TTGGAACTGG CCGAGGTGAT GCAGGTTTTG
ACCGGAGCGG CGAAGGATAC GGCCCTGGAG CACCTGACCG TTGCATTGGG CGGAGAGGTC
CTGGCCCTGG GCGGTCTGGC GGCGGATGCG AGCGATGGCG AGGGCCGGAT CAGACGCGCG
CTGGCAGGCG GAGAGGCCGC GCGGGTCTTC GCAGAGATGG TGGCCGAACT GGGCGGCCCG
GTCGATTTCG TGGAGCGCTG GCCCGACAGG TTGCCGGCCG CGCCGGTGAT GATGGATGTG
CATCCGGGAC AGGCGGGATA CGTCACCGCC ATCGACACCC GCGCCCTGGG AGAGATCGTG
GTGCATCTGG GCGGCGGCCG CCTGCGGGAG GACGACCGGA TCGACCCGGC GGTTGGCCTG
TCGGACATCG CGCGGCTGGG CACGCGGGTG GACGATGTGA CACCCCTTGC GCGCATGCAC
ACAGCCGATG AGGACGAAGG CCGCGCGCTG GCCGCCAAGC TGCGCCGCGC ATTCACCCTG
TCGGACGCTG CGATAGATAC GCCGCCCCTG ATCCATGAGA GGATTGCCTG A
 
Protein sequence
MSDARAILTK LRQGDRLTEA EVFWFAEGLA TGDVTDAQAG AFAMAVCQNG LGEEGRVQLT 
RAMRETGRVM AWHLDGPVID KHSTGGVGDC VSLLLAPALA ACGAFVPMIS GRGLGHTGGT
LDKLEAIPGY NTDVSPDDLQ EIVADIGCAI VGASGDIAPA DKRLYAVRDV TATVASVDLI
TASILSKKLA AGLEALVLDV KVGSGAFMGT EAEALGLAQA LVATAQGAGC MTTALVTDMN
QPLASSAGNA LELAEVMQVL TGAAKDTALE HLTVALGGEV LALGGLAADA SDGEGRIRRA
LAGGEAARVF AEMVAELGGP VDFVERWPDR LPAAPVMMDV HPGQAGYVTA IDTRALGEIV
VHLGGGRLRE DDRIDPAVGL SDIARLGTRV DDVTPLARMH TADEDEGRAL AAKLRRAFTL
SDAAIDTPPL IHERIA