Gene Jann_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1997 
Symbol 
ID3934450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1997294 
End bp1998562 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content61% 
IMG OID637904353 
Productphage integrase 
Protein accessionYP_509939 
Protein GI89054488 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.559264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.395324 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCATTG CAAAGAGAGG CCGTCTCTAT CACCTCCGAC GCCGCGTCCC GCGCCGGTAT 
TGCGGGGTTG AGCCGCGCGA AACCGTGTGG ATCAGCCTGC ACACTGACTC TGAGACAGTG
GCCATGAGCA AGGCGGACCG CGCATGGAGC CAGATGATTG AGGCTTGGGA AGCACGTTTG
GCCGGGAACA GTGACGATGC AGAGGCGCGA TACGAGGCAG CGCGTGACCT GGCTCGGGTT
CGAGGCTTTC GGTATCTGGA CGTCGGTGCC GTCGCGAAGT TACCTGTAGA AGACGTTGTC
GAGCGTGTGG AAGCAATTCC AGCCACGATG GATCAACTGG ACGCCATTGA GGGCGCTGCT
CTTCTTGGAG CGGCTCCTGA GCCTTGTACA ACGGTCACAA AGACGCTAGA GCTATACTGG
ACGCTTGCCC GTGAGAAGAC CTTTGGCAAA AGCGAAGACC AACTGCGCCG TTGGGAGGCG
CCCCGCAAGA AGGCTATCAA GAACTTCGTT GCCATCGTCG GCGACAAGGA CATCGCCAAC
ATCACCCGCG ACGACATGCT GGACTTCCGC CAGCACTGGC TCGACCGGAT CGAGGCCGGC
GAGGTCACGG CGAACTCGGC CAACAAGGAC CTGATCCATC TCGGCGACGT GCTTAAGACC
GTGAACACGA TGAAGCGGTT GGGGCTCATG CTGCCCTTGG GCGAGTTATC CTTCAAGCAG
GGTGAGGCGC GAACCCGCCC ACCGTTCAGC GAAGACTGGA TCACAACGCG GTTGCTGGCC
CCGGGCGCGC TGGACGGATT GAACGACCAA GCGCGCGGCA TCTTACTGGG GATGGTGAAC
ACCGGCTATC GCCCATCCGA GGGGGCCGCG TTGACGGCAG ACACGATCCG GCTCGATTGC
GACGTGCCGC ATATTTCGAT TGAAGCTGAT GGCCGTCAGC TGAAGTCACA CTTCGCCCGG
CGGGTGATCC CTCTGGCTGG CGCTTCGTTA GAGGCCTTCA AGCAATTCCC TGACGGCTTC
CCCCGCTACC GCAACAGCGC CAGCCTAAGC GCGGTGGTTA ACAAGTTCCT CCGCACCAAC
GGCCTGCTCG AAACCCCGCG TCACTCATTT TACTCGCTCC GGCACTCCTT CGAGGATCGC
ATGCTCGCCG CCGGGATCGA CGACCGGATA AGGCGGGATT TGTTTGGTCA TCGATTAGAT
CGGGAACGGT ACGGAAAAGG TGCGTCGCTC GAACATGTCG CCGAACTCGT CGGTCGGGTC
GCTTTCTGA
 
Protein sequence
MGIAKRGRLY HLRRRVPRRY CGVEPRETVW ISLHTDSETV AMSKADRAWS QMIEAWEARL 
AGNSDDAEAR YEAARDLARV RGFRYLDVGA VAKLPVEDVV ERVEAIPATM DQLDAIEGAA
LLGAAPEPCT TVTKTLELYW TLAREKTFGK SEDQLRRWEA PRKKAIKNFV AIVGDKDIAN
ITRDDMLDFR QHWLDRIEAG EVTANSANKD LIHLGDVLKT VNTMKRLGLM LPLGELSFKQ
GEARTRPPFS EDWITTRLLA PGALDGLNDQ ARGILLGMVN TGYRPSEGAA LTADTIRLDC
DVPHISIEAD GRQLKSHFAR RVIPLAGASL EAFKQFPDGF PRYRNSASLS AVVNKFLRTN
GLLETPRHSF YSLRHSFEDR MLAAGIDDRI RRDLFGHRLD RERYGKGASL EHVAELVGRV
AF