Gene Franean1_4623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4623 
Symbol 
ID5672968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5513040 
End bp5514068 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content70% 
IMG OID641243484 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001508900 
Protein GI158316392 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCCAT CGTATGAGGG TCGCCAGATT GTGGGAATCG ATCTCCATCG GCGGCGGTCG 
GTGATCGTGC GGATGACCCC GGACGGGTAT CCGCTGGGAA CGGTTCGGAT CAGCAACGAT
GCTGCCACCC TCGCTGCTGA GATCAGCCGT GCCGGTGAGC ACCCCGATGT GGTGCTCGAA
GCGACCTACG GCTGGTACTG GGCGGCGGAT GTGCTGGCCG CCGCTGGCGC GCAGGTTCAT
CTCGCGCACC CGCTGGGCGT GAAGGGTTTC GCCTACCGGC GGGTGAAGAA CGACGTCCGT
GACGCCGCGG ATCTCGCGGA CCTGCTGCGT ATGGGACGTC TGCCCGAGGC GTGGGTCGCT
CCACCGCCGG TCCGGGAACT GCGGGAGACC GTCCGTCACC GGGCGGCGCT CGTCGCGATC
CGGTCGGCGT GCAAGGCCCA GATCCACGCG GTCCTCGCCA AGAACGGTGT CGCGGTGCCC
ATGACCGACC TGTTCGGCCA GGCCGGAACC GACCTCCTCG GCCAGGTCCA GCTCCCGAGC
CCGTTCCACG CCCGCGTCAC GAGTCTGCGC CGCCTCATCG ACCTGCTCGA CTTCGAAATC
GACGCGGCCG CCTGTCAGCT CGCCGGCCGG CTCGCGCGGG ATCCGGGCTA TCAGGCGCTG
CTGGTCCTGC CCGGGGTCGG GAAGACCTTG GCCGCGGTGT TCCTCGCCGA GATCGGGGAC
ATCACCCGCT TCCCCACGCC CGGTCACCTT GCCAGCTGGG CTGGTCTCAC CCCCCGGCAC
CGTGAGTCCG ACACCACCGT GCACCGCGGC CACATCACCA AACAGGGCTC CTCCCTGATT
CGCTGGGCCG CGATCGAAGC CGTGTCGATC CTGCCTCCGA CGACCCCGGT CCTGGGCCCG
ACCAAGACCC GGGTCGCTGC CCGCCGCGGC ACCAACATCG GCAAGGTCGC CGCGGCCCGC
AAGCTGCTCA CGTTCGTCTT CTACGCGCTG CGCGACGGTG AGGTCCGCGC GCTGCACACG
GCGGCGTGA
 
Protein sequence
MSPSYEGRQI VGIDLHRRRS VIVRMTPDGY PLGTVRISND AATLAAEISR AGEHPDVVLE 
ATYGWYWAAD VLAAAGAQVH LAHPLGVKGF AYRRVKNDVR DAADLADLLR MGRLPEAWVA
PPPVRELRET VRHRAALVAI RSACKAQIHA VLAKNGVAVP MTDLFGQAGT DLLGQVQLPS
PFHARVTSLR RLIDLLDFEI DAAACQLAGR LARDPGYQAL LVLPGVGKTL AAVFLAEIGD
ITRFPTPGHL ASWAGLTPRH RESDTTVHRG HITKQGSSLI RWAAIEAVSI LPPTTPVLGP
TKTRVAARRG TNIGKVAAAR KLLTFVFYAL RDGEVRALHT AA