Gene Francci3_3016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3016 
Symbol 
ID3904369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3582171 
End bp3583508 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content71% 
IMG OID637880336 
Producttryptophan synthase subunit beta 
Protein accessionYP_482102 
Protein GI86741702 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0445574 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACCC GAACCGCTGC CGAGTCAGTT CGCCCCGCGG CCGCCGGAAC CGGGACGCCG 
TCGGGCTCCG GTCCGGTCCG TCCGGACACG TCGAGCTGGC AGGCGTTCCC CGACTCTTCG
GGGCACTTCG GGCGTTTCGG TGGTCGTTTC GTCCCCGAGG CCCTCATGGC GGCGCTGGAC
GAGCTCACCG CAGCCTACGC GGCGGCCAGG GTGGATCCCG GCTTCACCGC GGAACTGGAC
GGGCTGCTGG CGAGCTACGC GGGCCGGCCC ACTCCGCTCA CCGACGCCCA CCGGCTCACC
GCGCACGTTG GCGGTGCCCG CATCCTGCTC AAGCGTGAGG ATCTTGCGCA CACCGGTTCC
CACAAGATCA ATAATGTCCT CGGGCAGTGC CTGCTGACCC GGCGGATGGG CAAGACCCGG
GTTATCGCCG AGACCGGAGC CGGTCAGCAC GGTGTTGCCA CGGCCACCGC CTGCGCCCTG
CTCGGCCTGG ACTGCGTCGT CTACATGGGC GAGGAGGACA CCCGCCGGCA GGCCCTAAAC
GTCGCCCGGA TGCGCCTGCT CGGCGCCGAG GTCGTTGCGG TCACCTCCGG CAGCCGAACC
TTGAAGGACG CCATCAACGA GGCCATGCGC GACTGGGTGG CCACCGTTGA CACGACCCAT
TACTGCATCG GGTCGGTGAT GGGCCCGCAT CCGTTCCCGA TGCTGGTCCG CGACTTCCAG
CGGATCATCG GGGTCGAGGC GCGCGCCCAG GTGCTCGATC GTGTCGGCCG ACTGCCCGAC
GCCGTCGTCG CCTGCGTCGG TGGCGGCTCG AACGCGATGG GCATCTTCCA CCCGTTCATC
CCCGACGTCG AGGTCGCGCT GATCGGCTGC GAGGCCGGGG GGGACGGGGT CGCCACCGGT
CGGCACGCCG CGGCGATCGC CGGCGGTTCC GCGGGCGTGC TGCACGGCAT GCGGACGTTC
CTGCTGCAGG ACGAGTTCGG GCAGACCCAG GTCTCCCACT CGATCTCGGC GGGTCTGGAC
TACCCTGGCA TCGGCCCCGA GCATGCCCTG CTGCACGAGA CCGGTCGGGC GAGCTATCGG
GTCGTGGACG ACACCGCCGC GATGGAGGCG CTCGCGCTGG TGGCCCGTAC CGAGGGCATC
CTGGTCGCGA TCGAGAGCGC GCACGCCTTC GCGGGTGCCC TCGACGTCGC CCGGAAGCTC
GGTCCGGGCA CGACTGTCGT CGTCAACTGC TCCGGTCGGG GGGACAAGGA CGTCGACACC
GCCGCCCGCT GGTTCGACCT CCTCGACGAG GCCGATGCCG GCGGGGGCCC CGTCGGCGGT
CCGGCGCGAC CGGTCTGA
 
Protein sequence
MTTRTAAESV RPAAAGTGTP SGSGPVRPDT SSWQAFPDSS GHFGRFGGRF VPEALMAALD 
ELTAAYAAAR VDPGFTAELD GLLASYAGRP TPLTDAHRLT AHVGGARILL KREDLAHTGS
HKINNVLGQC LLTRRMGKTR VIAETGAGQH GVATATACAL LGLDCVVYMG EEDTRRQALN
VARMRLLGAE VVAVTSGSRT LKDAINEAMR DWVATVDTTH YCIGSVMGPH PFPMLVRDFQ
RIIGVEARAQ VLDRVGRLPD AVVACVGGGS NAMGIFHPFI PDVEVALIGC EAGGDGVATG
RHAAAIAGGS AGVLHGMRTF LLQDEFGQTQ VSHSISAGLD YPGIGPEHAL LHETGRASYR
VVDDTAAMEA LALVARTEGI LVAIESAHAF AGALDVARKL GPGTTVVVNC SGRGDKDVDT
AARWFDLLDE ADAGGGPVGG PARPV