Gene Franean1_7207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7207 
Symbol 
ID5675508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8799955 
End bp8801055 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content74% 
IMG OID641246044 
ProductGlu/Leu/Phe/Val dehydrogenase dimerisation region 
Protein accessionYP_001511432 
Protein GI158318924 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0334] Glutamate dehydrogenase/leucine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0169486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCGG TGGTCGACAC GTCCACGGGA GTCTCGTCGG TGTTCGAGGC CGGCGCGGAG 
CACGAGCAGG TGGTGTTCTG CTCCGATCGG GCCAGTGGCC TGCGCGCCGT GATCGCCATC
TACTCGACCG CGCTGGGGCC GGCGCTGGGA GGCACCCGGT TCCACGCCTA TCCGGACGAG
GCGTCCGCAC TAGCCGACGC CCTCGCGCTC TCCCGCGCGA TGGCCTACAA GGCCGCCTGC
GCGGGCCTGG ACCTCGGCGG CGGCAAGGCC GTCATCCTGG GCGACCCCGC CCGCGACAAG
ACCGAGGCGC TGCTGCGCGC CTACGGGCGC TTCATCGCCT CGCTGGGCGG CCGCTACGTG
ACGGCCTGCG ACGTCGGGAC GTACGTCGAG GACATGGACA CCATCGCCAG GGAGACCCGT
TGGGTCACCG GCCGCTCGCC GGCGCACGGC GGCTCGGGCG ACTCCGGCGT CCTGACCGCG
TACGGCGTCT TCGAGGGGAT GCGCGCCTGC GCCCGGCACC GATGGGGGAC ACCCTCGCTC
GCCGGGCGCC GGGTCGCCGT CAGCGGGGTC GGCAAGGTCG GCCTGCGCCT CGTGGGGCAC
CTGGTGGAGG AGGGGGCGAC CGTTCTGGCC GGGGATACCG ATCCGGGCGC CCTGCGGCGA
CTGGGAGCCC GCCATCCCGA CGTCCAGCTG GTGGCCGACC CCGACGAGCT CCTCCGGGCC
GAGGTCGACA TCTACGCGCC CTGCGCGCTG GGCGGGGTGC TCACCGACGA GGTCGTGCCC
GCGCTGCGGG CGGAGATCAT CTGCGGCGGG GCGAACAACC AGCTGGCCCA CCCGGGCATG
GACAAGGTCC TGGCCGACGC GGGCGTGCTG TACGCGCCCG ACTTCGTGGT CAACGCCGGC
GGACTGATCC AGGTGGCGGA CGAGATCGAG GGGTACTCCC CGGAACGGGC CAGGGCCCGG
GCCGCCCGGA TCTTCGACAC GGCGCTGGAC ATCTTCCGGC TCGCCGAGGC GGAGGGCGCC
ACCCCGGCGG TGGCGGCGGG ACGCTTCGCC GAGCGCCGGA TGACCGACAT CGGCCGGCTG
CGGGGCATCC TGCTGCCCTG A
 
Protein sequence
MSAVVDTSTG VSSVFEAGAE HEQVVFCSDR ASGLRAVIAI YSTALGPALG GTRFHAYPDE 
ASALADALAL SRAMAYKAAC AGLDLGGGKA VILGDPARDK TEALLRAYGR FIASLGGRYV
TACDVGTYVE DMDTIARETR WVTGRSPAHG GSGDSGVLTA YGVFEGMRAC ARHRWGTPSL
AGRRVAVSGV GKVGLRLVGH LVEEGATVLA GDTDPGALRR LGARHPDVQL VADPDELLRA
EVDIYAPCAL GGVLTDEVVP ALRAEIICGG ANNQLAHPGM DKVLADAGVL YAPDFVVNAG
GLIQVADEIE GYSPERARAR AARIFDTALD IFRLAEAEGA TPAVAAGRFA ERRMTDIGRL
RGILLP