Gene Franean1_1048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1048 
Symbol 
ID5669462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1228328 
End bp1229323 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content78% 
IMG OID641239977 
Productthioredoxin domain-containing protein 
Protein accessionYP_001505410 
Protein GI158312902 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3118] Thioredoxin domain-containing protein 
TIGRFAM ID[TIGR01068] thioredoxin 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.157412 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.991017 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGGA GGTCACCTGT CCCACCCGGC TCGCCGCGCA GGGGCGGGCC GGGCAGTATG 
CCCGACACCC TCGGCGGCCT ACGGCTCGCC GGTGCGGTTC CCCTAGACCC GAAGCCGGCC
CAGCCGCCGG CGCCCCCGGC CGGCCGGGCC GGCCCCGGTG GCTCTCCCGG CGGCGCGGCG
CCCGCCGGTG CCGCCGGCCC GGTCGTGATC GACGTGACCG AGGCGACGTT CGCCGAGGAC
GTCGTGAACC GTTCCATGCA GGTTCCCGTG GTCATCGACT TCTGGGCCGA GTGGTGCGGG
CCGTGCAAGC AGCTCAGCCC CATCCTGGAG CGCCTCGCCG CCGCCGACGG CGGTCGCTGG
GTGCTCGCCA AGGTGGACGT CGACGCCAAC CCCGGTCTCG CGCAGGCCGC GGGCGTGCAG
GGCATCCCCG CGGTCAAGGC GGTGGTCGGC GGCCGGATCA TCGGAGAGTT CACCGGCGCG
GTCCCCGAGC GGGAGGTGCG CGGCTGGCTG GACCAGCTCC TGAGCGTCGT CGGGGAGGCG
ATGGGCGGGC TGCCGGGGGC GGGAGCCGAG GGCGGTCCCG CGCTGCCGCC GAACATCGCC
GCCGCGGAGG ACGCGATGGC CACCGGCGAC CTGGACGCGG CGGCCGCCGC CTACCAGGCC
CAGCTCGCCG AGGCCCCCGG GGACGCGGAC GCCACCCTCG GCCTGGCCCG GGTGGAGCTG
CTGCGGCGGG TGCGCGGCTA CGACCCGGCC TGGCTGCGTC AGCGGCTCTC GGAGAACCCC
GACGACATCG AGGCGGCGCT CGCGGTGGCC GACCTGACCA TCGCCCAGGG CGACCCGGCC
ACCGGCCTGG GCCGCCTCGT CGACCTCGTC CGCCGCACCT CGGGCGACGA CCGGGAGAAG
CTGCGGGCGC ATCTGGTCGG GCTGTTCCAG GCGCTGGGCG ACGGCGAGCC GGCGGTCGCC
CCGGCCCGGC GGGCCCTCGC CGCCGCCCTG TTCTGA
 
Protein sequence
MQRRSPVPPG SPRRGGPGSM PDTLGGLRLA GAVPLDPKPA QPPAPPAGRA GPGGSPGGAA 
PAGAAGPVVI DVTEATFAED VVNRSMQVPV VIDFWAEWCG PCKQLSPILE RLAAADGGRW
VLAKVDVDAN PGLAQAAGVQ GIPAVKAVVG GRIIGEFTGA VPEREVRGWL DQLLSVVGEA
MGGLPGAGAE GGPALPPNIA AAEDAMATGD LDAAAAAYQA QLAEAPGDAD ATLGLARVEL
LRRVRGYDPA WLRQRLSENP DDIEAALAVA DLTIAQGDPA TGLGRLVDLV RRTSGDDREK
LRAHLVGLFQ ALGDGEPAVA PARRALAAAL F