Gene Franean1_2085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2085 
Symbol 
ID5670486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2510784 
End bp2512196 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content66% 
IMG OID641241007 
ProductFeS assembly protein SufB 
Protein accessionYP_001506428 
Protein GI158313920 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0719] ABC-type transport system involved in Fe-S cluster assembly, permease component 
TIGRFAM ID[TIGR01980] FeS assembly protein SufB 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.883646 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCT CTGCCGAGAC CGCCCTTGAG GGCCTTGGCT CCTACCGGTT CGGCTGGGCG 
GATGCGGACG CCTATGCCGT CGACGTGGAA CGTGGCCTCT CCGAGGCGGT CGTACGTAGC
ATCTCGGCGA AGAAGAACGA GCCGTCCTGG ATGACGGACC TGCGGCTGAA GGGCCTGAAG
CTCTTCGAGC GCAAGCCCAT GCCGACCTGG GGCGCGGACC TGTCCGGCAT CCACTTCGAC
AACATCAAGT ACTTCGTCAG GTCGACGGAG AAGCAGGCCG AGGAGTGGGC GGACCTGCCG
GAGGAGATCC GGGCCACCTA CGACCGGCTC GGCATCCCCG AGGCGGAGAA GCAGCGGCTG
ATCTCCGGTG TCGCGGCCCA GTACGAGTCG GAGGTCGTCT ACCACAAGAT CCGTGAGGAC
CTCGAGGAGC AGGGCGTCAT CTTCCTGGAC ACCGACTCGG GGCTGCGCGA GCACCCGGAG
ATCTTCCAGG AGTACTTCGG CTCGGTGATC CCGGTCGGCG ACAACAAGTT CGCCGCGCTG
AACACCTCGG TGTGGTCGGG TGGCTCGTTC ATCTACGTCC CGCCGGGCGT GCAGGTCGAG
ATCCCGCTGC AGGCCTACTT CCGGATCAAC ACCGAGAACA TGGGCCAGTT CGAGCGCACC
CTGATCATCG TGGACGAGGG CGCCTACGTC CACTACGTCG AGGGCTGCAC CGCGCCGGTC
TACTCGTCGG ACTCGCTGCA CTCCGCGGTC GTCGAGATCG TCGTGAAGAA GAACGCCCGC
TGCCGGTACA CGACCATCCA GAACTGGTCG AACAACGTCT ACAACCTGGT CACGAAGCGG
GCCGCCTGCC ACGAGGGCGC CACGATGGAG TGGATCGACG GCAACATCGG CTCCAAGGTG
ACGATGAAGT ACCCGGCGGT GTGGCTGCTC GGCGAGCAGG CCCACGGCGA GGTCCTCTCG
ATCGCCTTCG CAGGTGAGGG CCAGCACCAG GACGCCGGCG CCAAGATGGT GCACGCCGCG
CCGCGCACCT CGTCCAAGAT CGTCTCGAAG TCGGTGGCCC GCGGCGGCGG CCGGACCTCG
TACCGTGGCC TGGTCCAGAT CAACGAGGGC TCGCACGCCT CGCGGTCGAC GGTGAAGTGT
GACGCGCTGC TGGTCGACAC GGTCAGCCGC TCCGACACCT ACCCCTATGT CGACGTGCGC
GAGGACGACG CCTCCATCGG GCACGAGGCC AGCGTCTCCA AGGTCGGCGA GGACCAGCTC
TTCTACCTGA TGAGCCGCGG TCTGTCCGAG GACGAGGCGA TGGCCATGGT GGTGCGCGGC
TTCATCGAGC CGGTCGCCCG CGAGCTGCCC ATGGAGTACG CCCTCGAACT CAACCGGCTC
ATCGAGCTCC AGATGGAAGG CGCAGTCGGC TGA
 
Protein sequence
MTTSAETALE GLGSYRFGWA DADAYAVDVE RGLSEAVVRS ISAKKNEPSW MTDLRLKGLK 
LFERKPMPTW GADLSGIHFD NIKYFVRSTE KQAEEWADLP EEIRATYDRL GIPEAEKQRL
ISGVAAQYES EVVYHKIRED LEEQGVIFLD TDSGLREHPE IFQEYFGSVI PVGDNKFAAL
NTSVWSGGSF IYVPPGVQVE IPLQAYFRIN TENMGQFERT LIIVDEGAYV HYVEGCTAPV
YSSDSLHSAV VEIVVKKNAR CRYTTIQNWS NNVYNLVTKR AACHEGATME WIDGNIGSKV
TMKYPAVWLL GEQAHGEVLS IAFAGEGQHQ DAGAKMVHAA PRTSSKIVSK SVARGGGRTS
YRGLVQINEG SHASRSTVKC DALLVDTVSR SDTYPYVDVR EDDASIGHEA SVSKVGEDQL
FYLMSRGLSE DEAMAMVVRG FIEPVARELP MEYALELNRL IELQMEGAVG