Gene Jann_3589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3589 
Symbol 
ID3936064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3666152 
End bp3667381 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content62% 
IMG OID637905964 
Producttryptophan synthase subunit beta 
Protein accessionYP_511531 
Protein GI89056080 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.738122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGACG ATCTATTGAA CAGCTTCATG ACGGGACCCG ATGAAAAGGG CCGGTTCGGT 
GATTTCGGCG GGCGGTTTGT GTCGGAGACG CTGATGCCGC TGATCCTTGA GCTGGAGCAG
CAATATGAGC ACGCAAAGAC CGACGACAGC TTCTGGGCCG AGATGCAGGA CCTCTGGACC
CATTATGTGG GTCGCCCCTC TCCGCTCTAT TTCGCGAAAC GTCTGACCGA GCGTCTGGGC
GGCGCGAAGA TCTACCTCAA ACGCGATGAG CTGAACCACA CGGGCGCCCA CAAGATCAAC
AACGTGTTGG GCCAGATCAT CCTGGCGCGG CGCATGGGCA AGACCCGCAT TATTGCTGAG
ACAGGAGCGG GCCAGCACGG CGTGGCCACC GCCACGGTCT GCGCGAAGTT TGGCCTCAAA
TGCATCGTCT ACATGGGCGC GACGGATGTG GAGCGTCAGG CCCCCAACGT GTTCCGCATG
AAGCTTCTGG GCGCGGAGGT CGTACCGGTG ACCTCCGGGC GCGGCACACT CAAGGACGCG
ATGAACGACG CCCTGCGCGA CTGGGTCACC AACGTGCGCG ATACCTTCTA CTGCATCGGC
ACGGTCGCAG GTCCCCACCC CTACCCCGCC ATGGTCCGCG ATTTTCAGGC GATCATCGGC
AAGGAGACAC GCGACCAGAT GATGGCCGCA GAAGGCCGCC TGCCCGATAC GCTGATCGCC
GCCATCGGCG GTGGCTCCAA CGCCATGGGC CTGTTCTTCC CATTCCTGGA CGACAAGAGT
GTGAACATCA TCGGGGTGGA GGCCGGCGGC CACGGCGTCA ATGAGAAGAT GGAACATTGT
GCATCCCTGA CCGGGGGTCG CCCCGGCGTG CTGCACGGCA ATCGGACGTA TCTGTTGCAG
GATGAGGATG GACAGATCCT TGAAGGGCAT TCGATCTCGG CCGGTCTGGA TTATCCCGGG
ATCGGACCGG AGCACGCCTG GCTGCACGAG ATCGGGCGCG CGCAATATGT CTCCATCACG
GATCGGGAAG CGTTGGACGC CTTCCAGCTG TCATGCGAGA CTGAGGGTAT CATCCCGGCG
CTGGAACCGT CCCACGCGCT GGCCCACGTG TGCAAGATTG CGCCCGATAT GCCCCGCGAT
CATCTGCTGG TGATGAACAT GTGCGGACGC GGCGACAAGG ATATCTTCAC CGTCGCCCGG
GCCCTCGGCT GGGATATGGA CGGGGCCTGA
 
Protein sequence
MPDDLLNSFM TGPDEKGRFG DFGGRFVSET LMPLILELEQ QYEHAKTDDS FWAEMQDLWT 
HYVGRPSPLY FAKRLTERLG GAKIYLKRDE LNHTGAHKIN NVLGQIILAR RMGKTRIIAE
TGAGQHGVAT ATVCAKFGLK CIVYMGATDV ERQAPNVFRM KLLGAEVVPV TSGRGTLKDA
MNDALRDWVT NVRDTFYCIG TVAGPHPYPA MVRDFQAIIG KETRDQMMAA EGRLPDTLIA
AIGGGSNAMG LFFPFLDDKS VNIIGVEAGG HGVNEKMEHC ASLTGGRPGV LHGNRTYLLQ
DEDGQILEGH SISAGLDYPG IGPEHAWLHE IGRAQYVSIT DREALDAFQL SCETEGIIPA
LEPSHALAHV CKIAPDMPRD HLLVMNMCGR GDKDIFTVAR ALGWDMDGA