Gene Jann_3521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3521 
Symbol 
ID3935996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3578679 
End bp3579947 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content62% 
IMG OID637905896 
Productphage integrase 
Protein accessionYP_511463 
Protein GI89056012 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00222857 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCATTG CCAAACGAGG TCGCACATAC CACCTCCGTC GCCGGGTTCC GCGTCGGTAT 
CGCAGGGTTG AACCGCGTGG AACGGTCTGG ATCAGCCTGC ATACGGACTC GGAGACGGTT
GCCCGGAGCA AGGCCGACCG GGCGTGGAGC CAGATGATTG AGGCGTGGGA GGCGCGGCTG
GCCGGGAACA GCGCAGACGC GGAGGCGCGA CACGAGGCGG CGCGCGATCT GGCCCGTACG
CGGGGCTTTG GATACTTGGA TGCAGGCGCT GTAGCAAAGC TGCCAGTCGA GGATGTCGTC
GAGCGGGTTG AAGCGATCCC GGTGCCGGCA AAGCAGCCCG ACCCGGTTGA AGCCGCCACA
CTTCTCGGCA CGGTCCCTAA GCCGCGCACC ACGGTCACCA AGGCGCTGGA GCTTTACTGG
ACGCTGGCCC GTGAGAAGAC CTTCGGCAAA AGCGAGGACC AAATGCGCCG CTGGGAAGCC
CCCCGGAACA AAGCCATCAA AAATTTTGTG TCGGTCGTCG GCGACAAGGA AATCGCCAAC
ATCACCCGCG ACGACATGCT GGACTTTCGC CAGCATTGGC TTGATCGGAT CGAAGCAGGT
GAGGTCACGG CCAATTCCGC TAACAAGGAT CTGATCCATC TCGGCGATGT GCTGAAGACC
GTGAACACGA TGAAGCGGCT TGGGCTGGAT CTTCCGTTGG GTGAGTTGTC GTTCAAACAG
GGTGAGGCTC GGACCCGTCC ACCATTCAGT AATGATTGGA TCACGACGCG CCTGCTTACG
CCCGGTGCGC TCGACGGGTT GAACAACGAA GCGCGGGGCC TACTGCTTGG TATGGTGAAC
ACAGGCTACC GCCCGTCCGA GGGCGCCGCG CTGACAGTGG ACACGATCCG GCTCGATTGC
GACGTGCCGC ATATCTCCAT CGAACCCGAC GGGCGGCAAC TCAAGTCGCA CCATGCTCGC
CGAGTCATTC CCCTAACCGG TGTCTCCCTG AAGGCATTTG AGCAATTCCC CGAGGGCTTC
CCTCGCTACC GCAACCGAGC CACGCTCAGC GCGGTTGTTA ACAAGTTCCT CCGCACCAAC
GGCCTGCTCG AGACGCCGCG CCATTCCTTT TACTCGCTGC GTCATTCTTT TGAGGACCGC
ATGCTCGCTG CCGGGATCGA CGACCGGATA AGGCGTGATC TGTTTGGTCA TCGGTTGGAT
CGGGAACGGT ACGGCAAGGG CGCGTCGCTG GAACATGTGG CCGAGCTCGT CGCTCGCATC
GCCTTCTGA
 
Protein sequence
MSIAKRGRTY HLRRRVPRRY RRVEPRGTVW ISLHTDSETV ARSKADRAWS QMIEAWEARL 
AGNSADAEAR HEAARDLART RGFGYLDAGA VAKLPVEDVV ERVEAIPVPA KQPDPVEAAT
LLGTVPKPRT TVTKALELYW TLAREKTFGK SEDQMRRWEA PRNKAIKNFV SVVGDKEIAN
ITRDDMLDFR QHWLDRIEAG EVTANSANKD LIHLGDVLKT VNTMKRLGLD LPLGELSFKQ
GEARTRPPFS NDWITTRLLT PGALDGLNNE ARGLLLGMVN TGYRPSEGAA LTVDTIRLDC
DVPHISIEPD GRQLKSHHAR RVIPLTGVSL KAFEQFPEGF PRYRNRATLS AVVNKFLRTN
GLLETPRHSF YSLRHSFEDR MLAAGIDDRI RRDLFGHRLD RERYGKGASL EHVAELVARI
AF