Gene Franean1_2165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2165 
Symbol 
ID5670565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2599255 
End bp2600388 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content67% 
IMG OID641241086 
ProductIS605 family transposase OrfB 
Protein accessionYP_001506507 
Protein GI158313999 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.896467 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGAAGC GGGCGTACCG CTACCGCTTC AACCCGACCC CCGATCAGGC CGCCCAGCTC 
GCGCGAACCT TCGGCTGCGT CCGCTACGTG TACAACCGGG CGCTCGCCGA ACGGCACCGG
GCCTGGTTCC AGGAGCAGCG GCGGGTCACC CACGCCGAGA CCGACCGGAT GCTCACGGCG
TGGAAACGCG ACCCGGAAAC GGAATGGCTC GCCGAGCCGT CGAAAGGCCC GCTTCAGGCC
ACGCTGCGGA ATCTCCAGAC CGCGTATGTG AACTTCTGGC AGAAACGCGC CGGCTACCCG
ACGTTCAAGA AGAAGGGCAG GACCCTCGAC TCGGCGACCT ACTTCCGGAA CTGTTTCAGT
TTTCGGGACG GTCGGATCAC GCTGGCGAAG CAGGACGCGC CGCTGGCGAT CGTCTGGTCG
CGTCCGCTGC CCGAGGGCGC GGAGCCCTCG CAGGTCACGG TGTCGCGGAA CGCCCGCGGC
CAGTACCACG TCTCGATCCT GGTCGAAGAG AAGATCACTA CGCTTCCCGC GTTGCCCGGG
CGGGTGGGGA TCGACGCGGG GGTCACCTCG CTGGTCACCC TGTCGACGGG GGAGAAGGTG
GCCAACCCGA AGCACGAGCG TCGGGATCGG GCCCGGCTGG CCTGTGCGCA GCGGGACCTG
TCCCGGAAGG TGCAGGGGTC GGTGAACCGG GCGAAGGCCC GAGCGAGGGT CGCCCGGGTG
CACGGGCGGA TCGCCGACCG GCGTCGGGAT CATCTCCACC AGCTGTCCAC GAGGATCATC
CGCGAGAACC AAACGGTGGT CATCGAGGAT CTGTCCGTCC GCAACATGGT CAGGAACCAT
TCGCTCGCGC GGGCGATCTC CGATGCTTCG TGGTCGGAGT TGCGGCGGAT GTTGGAGTAC
AAGGCCGGCT GGTACGGTCG CACCATCATT GCGATCGATC GGTTCTATCC GTCGTCCAAA
ACCTGTTCGG TGTGCGGGTC GATCGTGAAG GAACTGCCGC TCAACGTCCG GGAACGGGCC
TGCCGTGGTT GCGGCACGGT CCACGACCGG GACGTGAACG CGGCGGTCAA CATTCTGGCC
GCGGGGCTCG CGGTGGCTGC CTGTGGAGAT GGAGTGAGAC CGCCTCGCTC CTGA
 
Protein sequence
MVKRAYRYRF NPTPDQAAQL ARTFGCVRYV YNRALAERHR AWFQEQRRVT HAETDRMLTA 
WKRDPETEWL AEPSKGPLQA TLRNLQTAYV NFWQKRAGYP TFKKKGRTLD SATYFRNCFS
FRDGRITLAK QDAPLAIVWS RPLPEGAEPS QVTVSRNARG QYHVSILVEE KITTLPALPG
RVGIDAGVTS LVTLSTGEKV ANPKHERRDR ARLACAQRDL SRKVQGSVNR AKARARVARV
HGRIADRRRD HLHQLSTRII RENQTVVIED LSVRNMVRNH SLARAISDAS WSELRRMLEY
KAGWYGRTII AIDRFYPSSK TCSVCGSIVK ELPLNVRERA CRGCGTVHDR DVNAAVNILA
AGLAVAACGD GVRPPRS