Gene Franean1_4076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4076 
Symbol 
ID5672434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4858159 
End bp4859244 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content76% 
IMG OID641242952 
Productthreonine aldolase 
Protein accessionYP_001508369 
Protein GI158315861 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000453005 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGTCC AGCCCCCGCG AACACGGCCC GCCGGCCTGC CCGTCGACCT GCGCAGCGAC 
ACCGTCACCC GGCCCACGCC CGGCATGCGC CGCGCGATGG CGGAGGCGGA GGTCGGCGAC
GACGTCTACC GCGAGGACCC GACCGTCCGC GAGCTCGAGG AGCATGCCGC CGCGCTGCTC
GGGCACGAGG CCGCGCTGTT CGTGCCGAGC GGCACCATGG GCAACTTCTG TGCCCTGCGC
GCCGGCGCCC CGGTGGGCAC CGAGGTCGTC GCCGACACCG ACGCGCACAT CGTCACCTAC
GAGCTCGGCG GGCTCGCCGC CCTCGGCGGC GTGCAGACCC GGACCCTCAG CGGCCTCGCG
GACACGCTCG ACCCGGCCGA CATCGCGGCC CAGCTGCGCG CGTTCCCGGT CGCGCACAAC
TACAACATGG TCCGGACGAG CGTGCTCGCG GTGGAGAACA CCCGGGCCCG GGCCGGTGGC
CGGGTGTGGC CCCTGGAGCG GCTCGACCGG CTGCGGGTGA TCACCGAGGC CGCCGGGGTG
GTCCTGCACT GCGACGGCGC CCGCCTCTGG AACGCGGCGG TGGCGCTCGA CGTCCCGCCG
CGCCGCCTCG GCGAGATCTT CGGGACGCTG TCGGTCTGCC TCTCGAAGGG CCTCGGCGCC
CCGGTCGGTT CCCTTGTCGT CGGCGGCGCC GAGCACGTCG AGCGGGCCCG CGAGTGGCGC
AAGCGGCTCG GCGGTGGGAT GCGCCAGGCC GGGGTGCTCG CCGCCGCCGG CCTGTACGCG
CTGCGGCACC ACCTCGACCG CCTCGCCGAC GACCACCGCC ACGCCGCCGC GCTGGCCGCG
ACCCTCGCCG ACGCGGCACC GCGGCGGGTC CACCCGGAGC GCACCGAGAC GAACATGGTG
CTCGTCGACG TCCCGGACGC GGCCGCCTTC TGCGCGCAGG CGGCGGACGG CGGTGTGCTC
GTCGGCCTGG CCGGTCCGAC GACGGTCCGG ATCGTCACCC ACCTCGACGT CGACGACACC
GCGATCCGCC GGGCCGGGGA CGTCCTCGCC CCGCTGCTGA ACTCCCTGCC ACCAGCCGGT
TCCTGA
 
Protein sequence
MSVQPPRTRP AGLPVDLRSD TVTRPTPGMR RAMAEAEVGD DVYREDPTVR ELEEHAAALL 
GHEAALFVPS GTMGNFCALR AGAPVGTEVV ADTDAHIVTY ELGGLAALGG VQTRTLSGLA
DTLDPADIAA QLRAFPVAHN YNMVRTSVLA VENTRARAGG RVWPLERLDR LRVITEAAGV
VLHCDGARLW NAAVALDVPP RRLGEIFGTL SVCLSKGLGA PVGSLVVGGA EHVERAREWR
KRLGGGMRQA GVLAAAGLYA LRHHLDRLAD DHRHAAALAA TLADAAPRRV HPERTETNMV
LVDVPDAAAF CAQAADGGVL VGLAGPTTVR IVTHLDVDDT AIRRAGDVLA PLLNSLPPAG
S