Gene Jann_1854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1854 
Symbol 
ID3934305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1841576 
End bp1842769 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content64% 
IMG OID637904208 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_509796 
Protein GI89054345 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.58476 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00243487 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAAGATA CCGCTCGCAA ACACTGGTCC AAACGCACCC GCGCGGTGCA TGCAGGCTCC 
CGCCGCAGCC AATACGGAGA GCTGTCGGAA GCGATGTTCC TGACGCAGGG CTTCGTCTAT
CCCACCGCCG AGGATGCGGA GGCCCGTTTC ATCAAATCCG GCGAGGATGA GTATATCTAC
GCCCGCTACG GCAACCCGAC CGTGGCGATG TTTGAAGACC GCATCGCGTC GCTGGAAGGG
GCGGAGGCGG GCTTTGCCAC GGCCTCGGGC ATGGCGGCAG TCAATGGCGC GCTCACGTCG
ATGCTGCGGG CGGGCGATCA CGTGGTGTCG TCCCGCGCGC TTTTTGGGTC GTGTCATTAT
GTCTTGGACG AGATCCTGAC CCGGTTTGGC GTGGACGTCA CCTTCGTGGA CGGCCCCGAT
CTGGACGCGT GGCGCGCCGC CATGCGCCCG GACACCAAGG CGGTGTTCTT CGAATCGCTC
TCCAATCCCA CGCTGGAGAT GATCGACATT CGCGCCGTGG CTGAGATCGC CCATGCCGTC
GGTGCGACGG TCATCTGCGA TAACGTCTTT GCCACCCCCA CGTTCAGCGA TGCCATCGCC
CAAGGCGTCG ATGTCGTTGT CTATTCCACC ACCAAACACA TTGACGGGCA GGGGCGCTGT
CTGGGGGGCG TGATCCTGGG GACGGAAGAA TTTATCCGCA AAACGGTGGA GCCTTACCTC
AAGCACACCG GCGGCGCGAT GTCGCCCTTC AACGCGTGGG TGATGCTGAA GGGGCTGGAG
ACGATGGACC TGCGGGTGCG GGCGCAAACT GCGTCGGCTC AGGCGATTGC GGAGGCGCTG
CAAGATGCGC CCGGTGTGGC GCGGGTGATT TATCCCGGCC TCGCCGACCA CCCCCAGCAC
GCGCTCTGCC AGGCGCAGAT GGGCGAGGGG GGGACCGTCG TTGCGGTGGA GGCCACGGAT
GGACAGGCGG GGGCGTTCCG CGCGCTCAAT GCGCTGGAGA TCTTCACGAT TTCCAACAAT
CTTGGCGATG CGAAGTCCAT TGCCACCCAT CCCACGACGA CCACCCACCA GCGCCTGACC
GATGAGCAGC GCGCGGAGAT GGGGATCACG CCGGGCCTGA TCCGTCTGTC GATCGGCTTG
GAAGACACCG ATGATCTGGT CGCAGACCTG CTTGATGCGT TGGAACTGGC ATGA
 
Protein sequence
MKDTARKHWS KRTRAVHAGS RRSQYGELSE AMFLTQGFVY PTAEDAEARF IKSGEDEYIY 
ARYGNPTVAM FEDRIASLEG AEAGFATASG MAAVNGALTS MLRAGDHVVS SRALFGSCHY
VLDEILTRFG VDVTFVDGPD LDAWRAAMRP DTKAVFFESL SNPTLEMIDI RAVAEIAHAV
GATVICDNVF ATPTFSDAIA QGVDVVVYST TKHIDGQGRC LGGVILGTEE FIRKTVEPYL
KHTGGAMSPF NAWVMLKGLE TMDLRVRAQT ASAQAIAEAL QDAPGVARVI YPGLADHPQH
ALCQAQMGEG GTVVAVEATD GQAGAFRALN ALEIFTISNN LGDAKSIATH PTTTTHQRLT
DEQRAEMGIT PGLIRLSIGL EDTDDLVADL LDALELA