Gene Jann_1993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1993 
Symbol 
ID3934445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1992420 
End bp1993445 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content64% 
IMG OID637904348 
Productcysteine synthase A 
Protein accessionYP_509935 
Protein GI89054484 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.535285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCC ACTCCGACCT CGCCTCTGCC ATCGGGAACA CGCCCCTCAT TCGCCTGCGT 
GCGGCGTCGG AGGCGACGGG ATGCGAGATC CTGGGCAAGG CCGAATTCCT CAACCCCGGC
CAATCGGTGA AGGACCGCGC CGCGCTCTTC ATCATCCGTG ATGCGGTCGC GCGCGGGGAT
CTGCGCCCCG GCGGCACCAT CGTGGAGGGC ACGGCGGGCA ACACCGGTAT CGGTCTTGCG
CTGGTGGGGG CCTCTCTGGG GTTCAAGACG GTGATCGTCA TCCCCGACAC GCAATCCCAG
GAAAAGAAGG ACATGATCCG CATCGCGGGC GCGGAGTTGA TCGAAGTGCC TGCGGTTCCC
TATTCGAACC CCAACAACTA CGTGAAATAC TCTGGCCGCT TGGCCGAACG TCTGGCGCAA
TCCGATCCCA ACGGTGCGAT CTGGGCCAAC CAGTTCGACA ATGTCGCCAA CCGTCAGGCC
CATATCGACA TGACGGGGCC CGAGATCTGG GAGCAGACGG AGGGCCGCGT GAACGGGTTC
ACCTGTGCCG TGGGCTCTGG CGGAACGTTG GCGGGCGTCG GCATGGCCTT GCAGCCCAAG
GGTGTAAAGG TGGCGCTGTC TGATCCCGAA GGGTCCGGCC TCTACAAGTT ATATACCGGG
CAGGAGGCGG GCGGGAATTC CATCACTGAA GGGATAGGGC AGGGCCGCAT CACCGCTAAT
CTTGAAGGGT TCACGCCCGA TGCCTGCTAT AAGATCCCCG ACGCAGAGGC CCTGCCTATC
GTCTACGATA TCCTCGCCGA CGAGGGGCTC TGTCTGGGGG GGTCCTCCGG GATCAACGTG
GCGGGCGCGA TCCGCATGGC GCGTGAGATG GGGCCGGGCC ATACCATCGT GACGATCCTG
TGTGACTACG GGACGCGGTA TCAATCCAAG CTGTTTAACC CCGACTTCCT GCGCGAAAAG
GGCCTGCCCG TGCCGCAATG GATGGACGCA AAGGGGCGCG ACGTCCCGCA GGTCTTCGCG
GAATGA
 
Protein sequence
MTIHSDLASA IGNTPLIRLR AASEATGCEI LGKAEFLNPG QSVKDRAALF IIRDAVARGD 
LRPGGTIVEG TAGNTGIGLA LVGASLGFKT VIVIPDTQSQ EKKDMIRIAG AELIEVPAVP
YSNPNNYVKY SGRLAERLAQ SDPNGAIWAN QFDNVANRQA HIDMTGPEIW EQTEGRVNGF
TCAVGSGGTL AGVGMALQPK GVKVALSDPE GSGLYKLYTG QEAGGNSITE GIGQGRITAN
LEGFTPDACY KIPDAEALPI VYDILADEGL CLGGSSGINV AGAIRMAREM GPGHTIVTIL
CDYGTRYQSK LFNPDFLREK GLPVPQWMDA KGRDVPQVFA E