Gene Sde_2091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2091 
Symbol 
ID3967475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2675313 
End bp2676428 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content49% 
IMG OID637921181 
Productchorismate synthase 
Protein accessionYP_527563 
Protein GI90021736 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGAA ACACCTTTGG TAAATTATTC ACCGTAACCA CATTTGGCGA AAGCCACGGT 
TTGGCGCTAG GCGCCATAAT TGATGGTTGC CCACCGGGAA TTGAATTGTC TGAAGAAGAT
TTACAGTTAG ATTTGGATCG ACGCAAACCG GGTACCTCGC GCTATACAAC ACAGCGTAAA
GAGGCCGATC AGGTAAAGAT TTTATCTGGT GTATTCGAGG GCAAAACAAC GGGTACGCCA
ATAGGCTTGC TAATCGAAAA CACGGATCAG CGCTCGAAAG ACTACGGCAA AATTAAAGAC
CAATTTCGCC CAGCTCATGC CGATTACACT TATATGCAAA AGTACGGCAT TAGGGATTAC
AGAGGCGGCG GTCGATCATC AGCTCGCGAA ACCGCAATGC GGGTTGCAGC GGGTGCCGTT
GCCAAAAAGG TACTTGCCAA CCTGTGGGGT ATAAAAATTC GCGGGTATTT GTCGCAACTG
GGGCCAATTA AAGCTGAGTT GTTAGATTGG AACGAAGTTG AGCAAAACCC GTTTTTCTGC
CCCGATAAGT CGAAAGTTCC CGAAATGGAG GCTTATATGC AGGCGCTAAA TAAAGAGGGT
AACTCGGTTG GTGCCAAAAT TACCGTCGTT GCCGAAAACA TGATTCCTGG TTTGGGAGAG
CCTGTTTTCG ATCGTATTGA TGCAGATTTG GCCCACGCGC TAATGGGTAT TAACGCGGTT
AAAGGTGTTG AAATAGGTGC AGGCTTTGCT TGTGTTGCTC AAAAAGGCAC AGAGCATCGC
GACGAAATAA CCCCAGAAGG GTTTAAGTCG AATCAAGCCG GTGGGGTGCT TGGCGGTATT
TCTACCGGGC AGGATTTAAT TGCGTCTTTA GCGCTTAAGC CTACCTCTAG CTTACGGATT
CCTGGCCAAA GTGTCGATAT AGAAGGTAAC CCTGTTGAGG TAATTACTAC TGGCCGCCAC
GACCCGTGTG TGGGTATTCG AGCAACGCCA ATAGCTGAAG CAATGATGGC GTTAGTCATT
CTCGATCACG CCCTTCGCAA CCGAGGTCAA AACGGTCACG TTCAATCGGG TGTGCCTATT
ATTCCTGGGA GCATTCCCGG CCAAATAGGT AGCTAG
 
Protein sequence
MSGNTFGKLF TVTTFGESHG LALGAIIDGC PPGIELSEED LQLDLDRRKP GTSRYTTQRK 
EADQVKILSG VFEGKTTGTP IGLLIENTDQ RSKDYGKIKD QFRPAHADYT YMQKYGIRDY
RGGGRSSARE TAMRVAAGAV AKKVLANLWG IKIRGYLSQL GPIKAELLDW NEVEQNPFFC
PDKSKVPEME AYMQALNKEG NSVGAKITVV AENMIPGLGE PVFDRIDADL AHALMGINAV
KGVEIGAGFA CVAQKGTEHR DEITPEGFKS NQAGGVLGGI STGQDLIASL ALKPTSSLRI
PGQSVDIEGN PVEVITTGRH DPCVGIRATP IAEAMMALVI LDHALRNRGQ NGHVQSGVPI
IPGSIPGQIG S