Gene Franean1_1554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1554 
Symbol 
ID5669957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1860955 
End bp1862088 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content67% 
IMG OID641240473 
ProductIS605 family transposase OrfB 
Protein accessionYP_001505899 
Protein GI158313391 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.596284 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGAAGC GGGCGTACCG CTACCGCTTC AACCCGACCC CCGATCAGGC CGCCCAGCTC 
GCGCGAACCT TCGGCTGTGT CCGCTACGTG TACAACCGGG CGCTCGCCGA ACGGCACCGG
GCCTGGTTCC AGGAGCAGCG GCGGGTCACC CACGCCGAGA CCGACCGGAT GCTCACGGCG
TGGAAGCGCG ACCCGGAAAC GGAATGGCTC GCCGAGCCGT CGAAAGGCCC GCTTCAGGCC
ACGCTGCGGA ATCTCCAGAC CTCGTATGTG AACTTCTGGC AGAAACGCGC CGGCTACCCG
ACGTTCAAGA AGAAGGGCAG GACTCTCGAC TCGGCGACCT ACTTCCGGAA CTGTTTCAGT
TTTCGGGACG GTCGGATCAC GCTGGCGAAG CAGGACGGGC CGCTGGCGAT CGTCTGGTCG
CGTCCGCTGC CCGAGGGCGC GGAGCCCTCG CAGGTCACGG TGTCGCGGAA CGCCCGCGGC
CAGTACCACA TCTCGATCCT GGTCGAAGAG ACGATCACTA CGCTTCCCGC GTTGCCCGGG
CGGGTGGGGA TCGACGCGGG GGTCGCCTCG CTGGTCACCC TGTCGACGGG GGAGAAGGTG
GCCAACCCGA AGCACGAGCG TCGGGACCGG GCCCGGCTGG CCCGTGCGCA GCGGGACCTG
TCCCGGAAGG TGCAGGGGTC GGCGAACCGG GCGAAGGCCC GAGCGAGGGT CGCCCGGGTG
CACGGTCGGA TCGCCGACCG GCGTCGGGAT CATCTCCACG CGCTGTCCAC GAGGATCATC
CGCGAGAACC AAACGGTGGT CATCGAGGAT CTGTCCGTCC GCAACATGGT CAGGAACCAT
TCGCTCGCGC GGGCGATATC CGATGCTTCG TGGTCGGAGT TGCGGCGGAT GTTGGAGTAC
AAGGCCGGCT GGTACGGTCG CACCCTCATT GCGATCGATC GGTTCTATCC GTCGTCCAAA
ACCTGTTCGG TGTGCGGGTC GATCGTGAAG GAACTGCCGC TCAACGTCCG GGAATGGGCC
TGCCGTGGTT GCGGCACGGT CCACGACCGG GACGTGAACG CGGCGGTCAA CATTCTGGCC
GCGGGGCTCG CGGTGGCTGC CTGTGGAGAT GGAGTGAGAC CGCCTCGCTC CTGA
 
Protein sequence
MVKRAYRYRF NPTPDQAAQL ARTFGCVRYV YNRALAERHR AWFQEQRRVT HAETDRMLTA 
WKRDPETEWL AEPSKGPLQA TLRNLQTSYV NFWQKRAGYP TFKKKGRTLD SATYFRNCFS
FRDGRITLAK QDGPLAIVWS RPLPEGAEPS QVTVSRNARG QYHISILVEE TITTLPALPG
RVGIDAGVAS LVTLSTGEKV ANPKHERRDR ARLARAQRDL SRKVQGSANR AKARARVARV
HGRIADRRRD HLHALSTRII RENQTVVIED LSVRNMVRNH SLARAISDAS WSELRRMLEY
KAGWYGRTLI AIDRFYPSSK TCSVCGSIVK ELPLNVREWA CRGCGTVHDR DVNAAVNILA
AGLAVAACGD GVRPPRS