Gene Franean1_1933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1933 
SymboltrpA 
ID5670334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2316180 
End bp2317100 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content78% 
IMG OID641240854 
Producttryptophan synthase subunit alpha 
Protein accessionYP_001506276 
Protein GI158313768 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0159] Tryptophan synthase alpha chain 
TIGRFAM ID[TIGR00262] tryptophan synthase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0945719 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.868829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCACG ACCAGTCGGC CCAGCAGGCA TGGGCCCAGG AGCAGCCGGC GCGCGACCAG 
CCGGCGCCGA GCCGGCTCGA GCCGTCCGGG CAGCGGCCGG CCCGGCGCGG CGGGCCGAGC
CCGCTGGACG AGGCCTTCGC GGCCGCGCGC AAGGACGGGC GGGCGGTGCT CGTCGGCTAT
CTCCCCGCCG GGTTCCCGAC GGTGGACCGC GGCATCGCGG CGATGCGGGC GATGGTCGCG
GCGGGCGTGG ACGTCGTCGA GGTCGGCCTG CCCTACTCGG ATCCGACGAT GGACGGCCCG
GTCATCCAGG ACGCCGCGGA CACCGCGCTG CGCGGCGGCG TGACCACCAG GGACGTGCTG
CGCACGGTCG AGGCGGTCGC CGAGACCGGG GCCCCCACCC TGGTGATGAC CTACTGGAAC
CCGGTGGAGC GGTACGGCAT GGAGGCGTTC GCCGCCGACC TGGCCGCCGC CGGCGGGGCC
GGGGCGATCA CCCCCGACCT GCCGCCGGAG GAGGCCGGCC CGTGGCTCGC GGCCAGCGCC
ACCCACGGCC TCGACCCGGT CTTCCTGGTC GCGCCGAGCT CGACCACCGA ACGGCTGCGC
CTGGTGACGG CGCACAGCGG CGGCTTCGTC TACGCGGCGT CGACCATGGG CGTCACCGGT
GCGCGCGCCG CCGTCGGTGT GAAGGCGGCC GGCCTGGTCG CCCGGGTCCG GGAGGTGACC
GACCTGCCTG TGGCGGTTGG CCTCGGCGTC AGCACCGGTG CTCAGGCGTC CGAGGTGGCC
GGCTTCGCCG ACGGCGTCAT CGTGGGCTCG GCGCTGGTCC GGGCCCTGGC GGCCGACGCG
CGGGACGGCG CCGACGGCGT CGGTGCGATC GAGCGGCTGG CGGCTGAGCT CGCCGCCGGC
GTGCGTTCGG CCACCGCCTG A
 
Protein sequence
MAHDQSAQQA WAQEQPARDQ PAPSRLEPSG QRPARRGGPS PLDEAFAAAR KDGRAVLVGY 
LPAGFPTVDR GIAAMRAMVA AGVDVVEVGL PYSDPTMDGP VIQDAADTAL RGGVTTRDVL
RTVEAVAETG APTLVMTYWN PVERYGMEAF AADLAAAGGA GAITPDLPPE EAGPWLAASA
THGLDPVFLV APSSTTERLR LVTAHSGGFV YAASTMGVTG ARAAVGVKAA GLVARVREVT
DLPVAVGLGV STGAQASEVA GFADGVIVGS ALVRALAADA RDGADGVGAI ERLAAELAAG
VRSATA