Gene Franean1_6099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6099 
Symbol 
ID5674420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7424410 
End bp7425957 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content73% 
IMG OID641244951 
Productradical SAM domain-containing protein 
Protein accessionYP_001510349 
Protein GI158317841 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.61579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0270865 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCCG TGTCGGGCCG GACAACATGC ACTCCGACGT GCACCGTTCC GACTTCGGCG 
GCCAGAACGG CGCCGGGCCT TCCGGCCCGG GAACGACCGG TGCGGGCGGG CCCGGCGCCG
GGGCGGGTTC CGGCTCCCGC GACAGCTCGG GATACGACCG GATCTGAGCC GGTCCGCCGC
CCGGGCCGGC GCCCGCCGGA GCCGGCGCCA CCCGGGCCGG CGCTTCCCCG GGCTGGTGCA
GCCCGGGCCG GGGCTTCTGA GGCGGTGTCT TCCGAGCCGG CTCGTCCGAG TTCGGCCGCC
CGCCGCCCGG GGCTCGCCCG GCGCAGGTCG TACGATCCTG GGGTGAGCCT TGTTGATGCC
GACGGTGAGT CCCCGGCGGG CCGGGCCCGC GTTCCCGACG CCGAGATCCG CGCGCTGCTC
GACCGGGCGG CCGACGGCGG GCGGATCTCG CCGGAGGAGG CGCTCCTCCT CTACACGTCC
GCTCCGCTGC ACGGGCTCGG GCGCGCGGCC GACGCCGTCC GTCGCCGCCG GTACCCGGAC
GGCATCGCCA CCTACATCAT CGACCGGAAC ATCAACTACA CCAACGTCTG CGTGACGGCC
TGCCGGTTCT GCGCCTTCTA CCGGTCGCCG AAGCACGCCG AGGGCTGGGT CCGCGACCTC
GACGACATCG TTGCCAAGTG CGGCGAGGCG GTCGAGCTGG GCGCCACCCA GATCATGCTG
CAGGGCGGCC ACAACCCCGA GTTCGGGATC GAGTGGTACG AGCGGACGTT CGCCGGCATC
AAGGCCGCCT ACCCGCAGCT CGCCCTGCAC TCGCTCGGCG CCAGCGAGGT CGTGCACATC
GCGCGGACGT CCGACCTGGA CTTCGCCGAG GTGATCACCC GGCTGCGCGA CGCCGGGCTC
GACAGCTTCG CCGGCGCGGG CGCGGAGATC CTCACCGAGC GTCCCCGGCA CGCGATCGCC
CCGCTCAAGG AGCCCGGCCA CGTGTGGCTG TCGGTCATGG AGATCGCCCA CGGCCTGGGC
CTGGAGTCGA CGGCGACGTT CATGATGGGC ACCGGCGAGA CCAACGCCGA GCGCATCGAG
CACATGCGCA TGATCCGGGA CGTCCAGGAC CGCACCGGCG GGTTCCGCTC GTTCATCCCG
TGGACGTACC AGCCGGAGAA CAACCACCTC GGCGGCCGGA CGCAGGCGAC GACCCTGGAG
TACCTGCGGC TGGTGGCCGT CGCGCGGCTG TTCTTCGACA ACGTGGCCCA CCTGCAGGGC
TCGTGGCTGA CCACCGGCAA GGAGGTCGGC CAGCTCACCC TGCACATGGG CGCCGACGAT
CTCGGCTCGG TGATGCTCGA GGAGAACGTC GTCTCCTCCG CGGGGGCCCG GCACCGCACC
AACCGGTCCG AGCTCATCCA CCTGATCCGG GCCGCCGACC GCATCCCGGC GCAGCGCGAC
ACGCTGTACC GGCACCTCGT CGTGCACCGT GACCCGGCGC TCGACCCGGT CGACGACCGG
GTGGCCTCGC ACTTCTCCTC GACCGCGCTG CCGCTGGTGT CGACCTGA
 
Protein sequence
MTPVSGRTTC TPTCTVPTSA ARTAPGLPAR ERPVRAGPAP GRVPAPATAR DTTGSEPVRR 
PGRRPPEPAP PGPALPRAGA ARAGASEAVS SEPARPSSAA RRPGLARRRS YDPGVSLVDA
DGESPAGRAR VPDAEIRALL DRAADGGRIS PEEALLLYTS APLHGLGRAA DAVRRRRYPD
GIATYIIDRN INYTNVCVTA CRFCAFYRSP KHAEGWVRDL DDIVAKCGEA VELGATQIML
QGGHNPEFGI EWYERTFAGI KAAYPQLALH SLGASEVVHI ARTSDLDFAE VITRLRDAGL
DSFAGAGAEI LTERPRHAIA PLKEPGHVWL SVMEIAHGLG LESTATFMMG TGETNAERIE
HMRMIRDVQD RTGGFRSFIP WTYQPENNHL GGRTQATTLE YLRLVAVARL FFDNVAHLQG
SWLTTGKEVG QLTLHMGADD LGSVMLEENV VSSAGARHRT NRSELIHLIR AADRIPAQRD
TLYRHLVVHR DPALDPVDDR VASHFSSTAL PLVST