Gene Jann_2068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_2068 
Symbol 
ID3934521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp2077362 
End bp2078759 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content65% 
IMG OID637904424 
Producturacil-DNA glycosylase superfamily protein 
Protein accessionYP_510010 
Protein GI89054559 
COG category[L] Replication, recombination and repair 
COG ID[COG1573] Uracil-DNA glycosylase 
TIGRFAM ID[TIGR00758] uracil-DNA glycosylase, family 4 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.803704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.600316 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCAGAG GCGTCGCGCC CGCCGAGGTG ACCTGGGGCG GCACTGATAC GCCGCGCGGG 
CTGTTTGATG AGCCGTCATC CGTGGCGCAA AGCGGCGACA CGAGCGTTCC GCGCAGCTTC
ATTTCCATGG CCGACAGTGT CGTCTGGCAC AGCGATCCGT CCCGCTTCGC GTGGCTCTAT
GCGTTTTTGT GGCGGCTGCG CGACGCCCCG CATCTGATGA CAGATCGCGG TGACGCCGAC
CTTGCCCGTC TGCGCGCGAT GGAGAAGAAC GTGCACCGCT GCCAGCACAA GATGAAGGCT
TTCGTGCGTT TCCGCGACAT CGGTGAGGCG GAGACCCCCC GCCGGTCCTT TGCCGCCTGG
TTCGAGCCGA CCCATCACAC GGTGGAGCCC ACGGCGGGCT TTTTCCAACG ACGTTTCGCC
GATATGGACT GGCGCATCAT CACGCCCGAC ATTTCAGCCA TTTTCGAAGG TGGCACGCTG
CGGTTCATTG AGGATCAGCC CAAACCGGGC CTGCCCGATG ACGCGAGCGA GGCGCTGTGG
ATCACGTATT ATCGCAACAT CTTCAATCCG GCGCGCTTGA AGGTGCAGGC GATGCAGTCC
GAGATGCCAA AAAAGTACTG GAAGAACCTG CCGGAGGCCG CTGCGATCCC GGATCTGATC
GCCACCGCGC CCGCCCGTGC CCGCGCGATG GCCGAGGCCG CGCCGACCTT GCCGCCAACC
CGCATGGCCT CCGCGCAGGA GCAGCAGCGC GCGTTTGCAT CGTCTTGGGA GGGCTCGGAT
GATGCGTTTC TGGCAGCGGT GAAGGGCTGC ACGCGCTGTC CGCTCCATCG ACACGCCACG
CAGACCGTGC CCGGGGAAGG GCCGGCCAAG GCCGCGCTGA TGATCGTGGG GGAGCAGCCG
GGCGATCAGG AGGATTTGCA GGGTCGCCCC TTCGTGGGGC CCGCGGGTCA CGTGTTCGAT
CAGGTCGCGG CGGAGGTGGG GTTGGACCGC GCAACCGCCT ACATCACCAA CGCCGTGAAG
CATTTCAAGT TCGTGCCACG GGGCAAGCGG CGTTTGCATC AGCGGCCCAA TGCGGGGGAG
GTCGCCTATT GCAAATGGTG GCTGGAGGCA GAGATTGCGC GCGTGACCCC CAAGCTGATC
CTGGCCATGG GGTCCACCGC GGCGCTTGCA TTGACCGGGT CGGGCAACAA CCTGCTGAAA
CGGTGCGGGA CAATTGAAGC CGTCGCCGGG CGACCACCTG TCCTGATCTC TTTGCACCCC
TCATACATCT TGCGGATCAA GGATGCCGAT CAGCGGGCGG AGGCCCGGCA GATGTACCAG
CGTGATCTCG GGCGCGCCAC ACAGATGGTG CAGGAGCGGG CCGGGCCGAT CGGTCTCGCA
GAAAACGGGC CGGAGTGA
 
Protein sequence
MARGVAPAEV TWGGTDTPRG LFDEPSSVAQ SGDTSVPRSF ISMADSVVWH SDPSRFAWLY 
AFLWRLRDAP HLMTDRGDAD LARLRAMEKN VHRCQHKMKA FVRFRDIGEA ETPRRSFAAW
FEPTHHTVEP TAGFFQRRFA DMDWRIITPD ISAIFEGGTL RFIEDQPKPG LPDDASEALW
ITYYRNIFNP ARLKVQAMQS EMPKKYWKNL PEAAAIPDLI ATAPARARAM AEAAPTLPPT
RMASAQEQQR AFASSWEGSD DAFLAAVKGC TRCPLHRHAT QTVPGEGPAK AALMIVGEQP
GDQEDLQGRP FVGPAGHVFD QVAAEVGLDR ATAYITNAVK HFKFVPRGKR RLHQRPNAGE
VAYCKWWLEA EIARVTPKLI LAMGSTAALA LTGSGNNLLK RCGTIEAVAG RPPVLISLHP
SYILRIKDAD QRAEARQMYQ RDLGRATQMV QERAGPIGLA ENGPE