Gene Jann_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_2067 
Symbol 
ID3934520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp2076061 
End bp2077296 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content61% 
IMG OID637904423 
Productradical SAM family protein 
Protein accessionYP_510009 
Protein GI89054558 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.408571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.630296 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAGT TGAACATGAC TGAAAAACTC GCGATCCTCA GCGATGCCGC GAAATATGAC 
GCGTCGTGTG CGTCCTCGGG ATCGACCCGG CGGGACTCGC GCGATGGCAA AAGCCTGGGC
TCGAATGAGG GCAGCGGTAT TTGCCACGCC TACGCGCCAG ACGGGCGCTG CATCAGCCTG
CTGAAGATCC TGATGACGAA TTTCTGCATT TATGACTGCG CCTATTGCAT CAACCGCGTG
TCCTCCAACG TCGCCCGCGC GCGGTTTTCG GTGGATGAGG TGGTGAAACT GACGATCGAG
TTCTACCGCC GCAACTACAT CGAAGGTCTG TTTCTGTCGT CCGGCGTCAT CAAGTCGCCC
GACGCGACGA TGGAAGCGAT GGTGCAGATT GCCCGCCGCC TGCGTCATGA AGAGAATTTT
CGAGGGTATA TTCACCTGAA GACCATACCG GATGCGGCCC CTGACCTGAT CGCGGAGGCG
GGGCTGCTGG CCGACCGCCT CTCCATCAAT GTGGAGCTGC CCACCGATGC CGCCGTCACC
CAATACGCGC CCGAAAAGAA GCCGGAACAG ATCCGCAAAG CCATGGCGGA TGTGCGTCTG
CGCAAGCAGG GGGCGGCGGA CAAATCGCAC ACCGGCAAGC GGCCCCCGCG TTTCGCGCCT
GCGGGGCAAT CGACGCAAAT GATCATCGGC GCGGACGGGT CGAACGACGC GACGGTTCTG
GGCCAATCCA CACGGCTCTA CTCCAGCTAC AAGCTGAAGC GCGTGTATTA TTCTGCGTTC
TCACCCATTC CTGACAGCTC TGCCAAACTG CCATTGGTGC GCCCGCCGTT GCAGCGCGAA
CACCGGCTCT ATCAGGCGGA TTGGTTGCTT CGATTCTACG GGTTTGATCT GGATGAGATC
ACGGCGGTGA CGCCGGACGG CAATCTGGAT CTACAGATTG ACCCGAAAAT GGCATGGGCT
CTGGCCCATC GCGGGGTGTT TCCCCTGGAC GTCAACACGG CCCCGCAGGA GATGTTGCTG
CGGGTTCCGG GCTTCGGCGT GAAGACCGTC AAGCGCATCC TGTCCACTCG CCGCCATCGC
ACGCTGCGAT ACGATGATAT CACGCGGATG GGGGCCTCGA TGAAAAAGGC GCGGGCGTTT
GTGACAGCGG GTGGGTGGAG CCCCGGCGCG CTGACCGACA GCGTGAATCT GCGGGCACGG
TTCGCGCCCC CGCCCGAGCA GTTGACGCTT CTATGA
 
Protein sequence
MTKLNMTEKL AILSDAAKYD ASCASSGSTR RDSRDGKSLG SNEGSGICHA YAPDGRCISL 
LKILMTNFCI YDCAYCINRV SSNVARARFS VDEVVKLTIE FYRRNYIEGL FLSSGVIKSP
DATMEAMVQI ARRLRHEENF RGYIHLKTIP DAAPDLIAEA GLLADRLSIN VELPTDAAVT
QYAPEKKPEQ IRKAMADVRL RKQGAADKSH TGKRPPRFAP AGQSTQMIIG ADGSNDATVL
GQSTRLYSSY KLKRVYYSAF SPIPDSSAKL PLVRPPLQRE HRLYQADWLL RFYGFDLDEI
TAVTPDGNLD LQIDPKMAWA LAHRGVFPLD VNTAPQEMLL RVPGFGVKTV KRILSTRRHR
TLRYDDITRM GASMKKARAF VTAGGWSPGA LTDSVNLRAR FAPPPEQLTL L