Gene Franean1_4039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4039 
Symbol 
ID5672397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4817470 
End bp4818603 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content66% 
IMG OID641242915 
ProductIS605 family transposase OrfB 
Protein accessionYP_001508332 
Protein GI158315824 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0972683 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGAAGC GGGCGTACCG TTACCGCTTC TACCCGACCC CAGAGCAGGC CGAGCAGCTC 
GCCCGCGCCT TCGGCTGCGT CCGCTACGTC TACAACTGGG CACTCGCCGA GCGGTCTCGC
GCGTGGTTTC AGGAACGGTG TCGCATCACG CACGCCGAGA CCGACAAGAT GCTCACCGCG
TGGAAACGGG ACCCGGAGAC GGCATGGCTC GCCGAGCCGT CGAAAGGACC CTTGCAGGCC
ACGCTACGGC ACCTTCAGTC GGCGTTCGTG AACTTCTGGG AGAAGCGGGC CGGCTACCCG
TCTTTCAAGA AGAAGGGCAA GACCCTCGAG TCGGCGACCT ACTTCCGGAA CTGCTTCAGC
TACCGGAACG GCGCTGTCAC CCTCGCCAAG CAGGACCGGC CGCTGGACAT CGTCTGGTCG
CGTCCGCTGC CCGACGGCGC GGACCCGTCG CAGGTGACGG TGTCGCGGAA CGCCCGCGGC
CAGTACCACA TCTCGATCCT GGTCGAGGAG ACCATCACCA CCCTGCCTCC GACATCGGGG
CAGGTGGGGA TCGACGCGGG GATCACGAGT CTGGTCACTC TGTCGACCGG GGAGAAGGTC
ACCAACCCCC GGCACGAGCG TGCCGACCGG GCTCGGCTCG CACGCGCACA GTGGGAATTG
TCCCGCAAGG AGAAGGGCTC AGCGAACCGG GCGAAAGCCC GCGCGAGGGT CGCGAAGGTC
CACGGCCGTA TCCGGGACCG TCGTCGGGAT CATCTGCACA AGCTGTCCAC GAGGATCATC
CGCGAGAACC AAACGGTGGT CATCGAGGAC CTGTCCGTCC GCAACATGGT CCGCAGCCAT
TCGCTCGCAC GGGCGATCTC CGACGCATCG TGGTCGGAGC TGCGGACGAT GCTGGAGTAC
AAGGCCGGCT GGTACAGCCG CACCGTGATC GCGATCGACC GTTTCTACCC GAGCAGCAAG
ACCTGTTCGG TGTGCGGGTC GATCGTCGAG AAGATGCCGT TGAACGTCCG GGAATGGGCC
TGCCGCGGCT GCGGCACAGT CCACGACCGG GACGTGAATG CGGCGCGGAA CATTCTGGCC
GCGGGGCTCG CGGTGGCTGC CTGTGGAGAT GGAGTGAGAC CGCCTCGCTC CTGA
 
Protein sequence
MVKRAYRYRF YPTPEQAEQL ARAFGCVRYV YNWALAERSR AWFQERCRIT HAETDKMLTA 
WKRDPETAWL AEPSKGPLQA TLRHLQSAFV NFWEKRAGYP SFKKKGKTLE SATYFRNCFS
YRNGAVTLAK QDRPLDIVWS RPLPDGADPS QVTVSRNARG QYHISILVEE TITTLPPTSG
QVGIDAGITS LVTLSTGEKV TNPRHERADR ARLARAQWEL SRKEKGSANR AKARARVAKV
HGRIRDRRRD HLHKLSTRII RENQTVVIED LSVRNMVRSH SLARAISDAS WSELRTMLEY
KAGWYSRTVI AIDRFYPSSK TCSVCGSIVE KMPLNVREWA CRGCGTVHDR DVNAARNILA
AGLAVAACGD GVRPPRS