Gene Jann_1488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1488 
Symbol 
ID3933935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1457551 
End bp1458933 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content58% 
IMG OID637903838 
Productdihydropyrimidinase 
Protein accessionYP_509430 
Protein GI89053979 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type
[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.366627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAACA CGACAGTCAT TCGTAATACG AAAGTCCTAG GCGGCGCGAC GCTGACCGCC 
ATGGACATCG CAATTTCGGA TGGGACAATC CAAGCGATCG GACCGAACCT CCCCGACGCG
GGTCAGAGTA TTGACGCCAG CGGTCTTGTT GCGACGCCAG GCGGTGTCGA TCCCCACGCC
CATATCGAGC AACGCTCGGG CATGGGGTTG ATGAATGCCG ACACGTTTGA AACTGCAACA
CGCTCTGCCG CGTTAGGAGG CACTACCAGC GTCATTTCTT TCGCAGCACA GGGGAAAGGG
CAGCGTTTGC AAGACGTTGT GGGTGACTAT ACCGCCCGCG CGAAACGTGG AGCAATGATC
GACTACGCGT TTCACCTGAG CTTATCGGAC CCTGACGTGC CGCATTTTGT TGACGATTTG
CGCGCCCTCA TTGCTGGAGG GCACAGATCT TTGAAGGTTT TCACGACCTA CGACATTGCG
CTGAACGACG ACCAGATCGA GAAGATCCTA CAAGTGGCTG GCCCGGCCGG CGCATTGACA
TGTGTTCACG CGGAAGACGA TACGACTTTA AGATCTGCTC GCAACATTTT GCTATCCCGT
CATAAATCTC TTCCCAAGCA TCACGCACAG GCGCGTCCTG AGGCCGCCGA ACGCATAGCC
GTGGCCCGGA TCTGTGCGAT GGCCGAGCGC ACGGACGCGC CGGTCATGAT TTTCCATGTC
TCCTGCGCGA GCGCCGCGAA AGCCGTCGCC GATGCCCGAG CGCGGGGCGC CCCGGTTTTT
GCTGAAACCT GCCCCCATTA CTTGTTCATG ACGGCTGACA TTCTTGACAA ACCGGGTGTC
GAGGGCGCGA AGTGGATGTG CTCTCCGCCG CAACGTACGG AAAGGGATCA AGCCGCGCTT
TGGGCCGCTT TGAATGATGG GACACTCGAT CTGATTTCAT CCGATCACGC CCCGTACCGT
TTTGATAAGA CTGGAAAACT AAAGAATGGC TTGAGCCCAC CATTTCCAGA TATCGCGAAC
GGTCTTCCCG GGTTAGAGAC CCGGCTGCCC CTGCTGTTTG ACGCGATCGT GAAACGCTCC
AAGCTTCCAT CGGCCGCATT TGCAGCCCTT ACTGCGGGCG CTGCCGCGGA TATTTATGGT
CTTCCCGGCA AGGGCCGCCT CCGACCCGGC GCGGACGCGG ATATTGTGCT GTGGGATCCC
GAGAAGTCTT ATACGTACGG CGCGGACGAC CTGCATGACA ACGTGGGCTA TAATCCCTAT
GAAGGTCATT GTGTAACTGG TTGGCCAGTA AATGTTTTCT TGCGCGGGCA GCGGATCGTG
AGAGACGGAG CGCTCACTGC CCACCCTGGC CAAGGCCGCT GGATTGATCG GAGACCCACA
TGA
 
Protein sequence
MFNTTVIRNT KVLGGATLTA MDIAISDGTI QAIGPNLPDA GQSIDASGLV ATPGGVDPHA 
HIEQRSGMGL MNADTFETAT RSAALGGTTS VISFAAQGKG QRLQDVVGDY TARAKRGAMI
DYAFHLSLSD PDVPHFVDDL RALIAGGHRS LKVFTTYDIA LNDDQIEKIL QVAGPAGALT
CVHAEDDTTL RSARNILLSR HKSLPKHHAQ ARPEAAERIA VARICAMAER TDAPVMIFHV
SCASAAKAVA DARARGAPVF AETCPHYLFM TADILDKPGV EGAKWMCSPP QRTERDQAAL
WAALNDGTLD LISSDHAPYR FDKTGKLKNG LSPPFPDIAN GLPGLETRLP LLFDAIVKRS
KLPSAAFAAL TAGAAADIYG LPGKGRLRPG ADADIVLWDP EKSYTYGADD LHDNVGYNPY
EGHCVTGWPV NVFLRGQRIV RDGALTAHPG QGRWIDRRPT