Gene Jann_1555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1555 
Symbol 
ID3934003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1525765 
End bp1526922 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content54% 
IMG OID637903906 
ProductAraC family transcriptional regulator 
Protein accessionYP_509497 
Protein GI89054046 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.294968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.931428 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTGG ATATCGCCAT CAGCAGCGTC ACGGCAGGCA TTTGCCTGTT TTGCGCGCAT 
CTCCTGCTGC TCAGGCGGCG AGACACTGGG GTTTATCTGC CCCTTGCCTT GTTATTCCTG
TTCCAGGGGA TCTCTACCGG CGTTGCCGCA CTGGCCGGAA CATATGATCC GGATAGTTTG
GGCATCCTTT TCCGCATCAG CATCATTGTC GGTGGCCTGG AAATCACCCT TCCGTTTCTT
CTTTGGGTCT ATGTGCGGGC GCTGACAACA GAAGGCCAAA CGGAACGTAT CCCGAAATTG
CCGTATCACG TGATCCCGAT TGTTCTGGTT GTCCTCGCAT TCTGGTCGCT CTTGTTTCTT
CCAGACGGAT TTGCAGACAC CGAATTGGAA GATGATGACC CGCGTTTGTT GGGATTTGTC
GCTATCGCGC TGGCCGTTAT GCTTGCGGAT ATTGCGTTCA AAGCGATGGT AGCCACTTAC
ATCTACCTGA TCATCCGCCG CCTCATGGCC TATCGCACGC GTCTAAAGGA TGTGTTCGCC
AGCACCGAAA ACCGAGAACT AACTTGGATA TGGGTGATCT TGATTTGCAT GGCGGTCTAC
CTCAGCGTGA GTATCGCCTT TACCGCGTCG ATTGTGTCCG GTGTTTTTGC CGAAGAAACC
CAAGAAACGT GGTTGCCGAC GCTGAACGGT ATCGCGCTTC TTGGATTGTT CTGGGCCCTT
GGCGTCTGGG GGTTGCGGCA GCGTCCCGGC CTGACGCGGC AGCCCGTCGT CGCCGCCCCG
GAGCCCGATG ATCCCAAGCC GCGAAAATAT GAGAAATCCG CGCTTGATGA CGAACGGCTG
CAACGCATTG CCCGGAAGGT TGAGGCGGCG ATGGCCGAAG ACACCCTCTA CCGTGATCCC
AACTTATCAC TTTGGGATCT GGCAAAGCAC ATTGGCGTCA CGTCTCACTA TGTGTCTCAA
GCGCTGAACA CCCATCTGAA CAAGAGTTTC TTTGACCTGG TGAATGGATG GCGGATCAAG
GATGCCATCG AACAGTTGAC CACGACAGAT GAGACCATCT TGACGATTGC CTATGACGTC
GGCTTCAACT CCCGCTCCGC ATTTTATAAA GCGTTCAAAC GCGAAACAGG GCGAACCCCT
TCTGACCTGA GAAACTAG
 
Protein sequence
MTLDIAISSV TAGICLFCAH LLLLRRRDTG VYLPLALLFL FQGISTGVAA LAGTYDPDSL 
GILFRISIIV GGLEITLPFL LWVYVRALTT EGQTERIPKL PYHVIPIVLV VLAFWSLLFL
PDGFADTELE DDDPRLLGFV AIALAVMLAD IAFKAMVATY IYLIIRRLMA YRTRLKDVFA
STENRELTWI WVILICMAVY LSVSIAFTAS IVSGVFAEET QETWLPTLNG IALLGLFWAL
GVWGLRQRPG LTRQPVVAAP EPDDPKPRKY EKSALDDERL QRIARKVEAA MAEDTLYRDP
NLSLWDLAKH IGVTSHYVSQ ALNTHLNKSF FDLVNGWRIK DAIEQLTTTD ETILTIAYDV
GFNSRSAFYK AFKRETGRTP SDLRN