Gene Jann_1474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1474 
Symbol 
ID3933921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1443259 
End bp1444635 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content62% 
IMG OID637903824 
Productcytosine deaminase-like protein 
Protein accessionYP_509416 
Protein GI89053965 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.534855 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTCA AGACGCTCCC CTCTGACAAT CTGCGGTTGG AGAACGTGAC GATTCCTGGG 
TGCCTGATCG GGCAATCGGG CGTTGTGCGC ACAGCGCTGA GTATCGCCGA CGGTGTGATC
GCCGCGAGTG GTGGCACGCC GGTCGATATG AGGGGGGCGA TGGTGTTTCC CTGTTTCGTC
GACATGCACA CGCATCTGGA CAAGGGCCAT ATCTGGCCCC GCTCGCCCAA TCCCGACGGC
ACGTTCATGG GGGCGCTGGA GACGGTGCGC GCAGACAAAT CGGGGCGGTG GGACAGTGTT
GATCTGCCGC CGCGCATGAA TTTCAGCCTG AACTGCGCCT ACGCCCACGG CACGCGGGCC
ATTCGAACCC ATCTGGACTA TTTTGGAGAG ACTGACGAGG CGCGGGTCAG CTGGGACGTC
TTCGCACAGA TCCGTGACGA CTGGGCCGGT CGGATTGATC TTCAGGCCGC TGTGTTGACG
GGGATCGACA TGGCCGCTGA TGCCGGCGCT TTGGCAACTT GCGCAGACCT TGTGGCGTCC
CATGGCGGCG CTCTCGGGGC CGTGACCTAT CCGGAGCCGG ACCTGCGCGC TTGGTTAACT
GCATACTTTG AGGCGGCTGC GCTGCGTGGG ATGGACCTGG ATTTCCACGT GGATGAGACA
ATGGATCCAG AGGTTAACAC CCTTAAGGAT ATTGCGGAAA TTGTGCTAGA GACAGGATTT
AAAGGGAAAA TCACCGTGGG CCATCTCTGC TCCCTGTCGG TGATGGAAGA CGCGGTGGCC
ATGGCCACTC TTGATCTGGT CGCCAAGGCC GGGCTCGATG TCGTCAGTCT GCCGATGTGC
AACCTGTATC TGCAAGACCG CCACGCCGCG CGCACGCCGC GAGGCCGGGG CATCACTTTG
GTGCACGAGA TGAAGGCGCG GGGCATCAAC GTCAGTTTCG CCTCAGACAA CACCCGAGAT
CCGTTCTATG CCTACGGTGA TATGGACATG ATCGAGGTGA TGCGGGAGGC CACGCGCATT
GGCCATCTGG ACCACTCCGA TGACGATTGG ACCCATGCGT TTCTGGGCAA TCCTGCCCGG
GCCTGTGGCG TCACGGCACC GTCGTTGATG CCCGGAGCAC CCGCCGATTT GGTGATTTGC
CGCGCCCGCG AATGGACGGA ACTTTTCGCC CGCCCGCAGG CTGACCGGAT CGTGCTGCGT
GATGGGCGCC AGATTGATCG CGCTTTGCCG GATTACGCCG AATTGGATTA CCTTATGACG
CCCTCAAGCA GCGAAGCGGT GGGGCAGAAA GCATCGCCCT CAAGCAGCGA AGCGGTAGGG
CAGCACGCAT CGCCCTCAAG CAGCGACGCG GTCGGGCAGG GAGAGATCGC CAAATGA
 
Protein sequence
MDFKTLPSDN LRLENVTIPG CLIGQSGVVR TALSIADGVI AASGGTPVDM RGAMVFPCFV 
DMHTHLDKGH IWPRSPNPDG TFMGALETVR ADKSGRWDSV DLPPRMNFSL NCAYAHGTRA
IRTHLDYFGE TDEARVSWDV FAQIRDDWAG RIDLQAAVLT GIDMAADAGA LATCADLVAS
HGGALGAVTY PEPDLRAWLT AYFEAAALRG MDLDFHVDET MDPEVNTLKD IAEIVLETGF
KGKITVGHLC SLSVMEDAVA MATLDLVAKA GLDVVSLPMC NLYLQDRHAA RTPRGRGITL
VHEMKARGIN VSFASDNTRD PFYAYGDMDM IEVMREATRI GHLDHSDDDW THAFLGNPAR
ACGVTAPSLM PGAPADLVIC RAREWTELFA RPQADRIVLR DGRQIDRALP DYAELDYLMT
PSSSEAVGQK ASPSSSEAVG QHASPSSSDA VGQGEIAK