Gene Franean1_0473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0473 
Symbol 
ID5668893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp557745 
End bp558773 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content70% 
IMG OID641239403 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001504841 
Protein GI158312333 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCCAT CGTATGAGGG TCGCCAGATT GTGGGAATCG ATCTCCATCG GCGGCGGTCG 
GTGATCGTGC GGATGACCCC GGACGGGTAT CCGCTGGGAA CGGTTCGGAT CAGCAACGAT
GCTGCCACCC TCGCTGCTGA GATCAGCCGT GCCGGTGAGC ACCCCGACGT GGTGTTGGAG
GCGACGTACG GCTGGTACTG GGCGGCGGAT GTGCTGGCCG CCGCTGGCGC GCAGGTTCAT
CTCGCGCACC CGCTGGGCGT GAAGGGTTTC GCCTACCGGC GGGTGAAGAA CGACGTCCGT
GACGCCGCGG ATCTCGCGGA CCTGCTGCGG ATGGGACGTC TGCCCGAGGC GTGGGTCGCT
CCACCGCCGG TGCGGGAACT GCGGGAGACC GTCCGTCACC GTGCGGCGCT CGTCGCGATC
CGTTCGGCGT GCAAGGCCCA GATCCACGCG GTCCTCGCGA AGAACGGTGT CGCGGTGCCC
ATGACCGACC TGTTCGGCCA GGCCGGAACC GACCTCCTCG GCCAGGTCGA GCTCCCGAGC
CCGTTCCACG CCCGCGTCAC GAGTCTGCGC CGCCTCATCG ACCTGCTCGA CTTCGAAATC
GACGCGGCCG CCTGTCAGCT CGCCGGCCGG CTCGCGCGGG ATCCGGGCTA TCAGGCGCTG
CTGGTCCTGC CCGGGGTCGG GAAGACCTTG GCCGCGGTGT TCCTCGCCGA GATCGGGGAC
ATCACCCGCT TCCCCACGCC CGGTCACCTT GCCAGTTGGG CTGGTCTCAC TCCCCGGCAC
CGTGAGTCCG ACACCACCGT GCACCGCGGC CACATCACCA AACAGGGCTC CTCCCTGATT
CGCTGGGCCG CGATCGAAGC CGTGTCGATC CTGCCTCCGA CGACCCCGGT CCTGGGCCCG
ACCAAGACCC GGGTCGCTGC CCGCCGCGGC ACCAACATCG GCAAGGTCGC CGCGGCCCGC
AAGCTGCTCA CGTTCGTCTT CTACGCGCTG CGCGACGGTG AGGTCCGCGC GCTGCACACG
GCGGCGTGA
 
Protein sequence
MSPSYEGRQI VGIDLHRRRS VIVRMTPDGY PLGTVRISND AATLAAEISR AGEHPDVVLE 
ATYGWYWAAD VLAAAGAQVH LAHPLGVKGF AYRRVKNDVR DAADLADLLR MGRLPEAWVA
PPPVRELRET VRHRAALVAI RSACKAQIHA VLAKNGVAVP MTDLFGQAGT DLLGQVELPS
PFHARVTSLR RLIDLLDFEI DAAACQLAGR LARDPGYQAL LVLPGVGKTL AAVFLAEIGD
ITRFPTPGHL ASWAGLTPRH RESDTTVHRG HITKQGSSLI RWAAIEAVSI LPPTTPVLGP
TKTRVAARRG TNIGKVAAAR KLLTFVFYAL RDGEVRALHT AA