Gene Franean1_0685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0685 
Symbol 
ID5669102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp803665 
End bp804993 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content72% 
IMG OID641239612 
ProductIS605 family transposase OrfB 
Protein accessionYP_001505050 
Protein GI158312542 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.289129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACGT TGCAGGCGTA CCGGTTCGCG CTCGACCCGA ACGACGCCCA GGCCGCGAAC 
CTGCGTCGCC ACGCGGGGGC GGCCCGGTTC GCCTACAACT GGGGCCTGGC CCGGGTGAAG
GCCGCGTTCG CGCAGCGCGA CGCGGAGAAG TCCTACGGCC TGGACGGCGA CCTGCTCACC
CCGGTGCCGT GGACGCTGCC CGCGCTGCGG CTTGCCTGGA ACGCCGCCAA GCGGGACGTC
GCGCCCTGGT GGGACGAGTG CTCGAAGGAG GCGTACTCGG CCGGGCTGGA CCAGTTGGCC
CGTGCGTTGA AGAACTTCAC CGACTCCCGG AAGGGAAAGC GCAACGGCCG CCGGGTCGGT
TTTCCCCGGT TCAAGAAGCG CGGGAAGGCC CGCGACTCGT TCCGCTACAC GACCGGTGCC
TACGGCCCCG CGACCGATCT GTACGTGAAA CTGCCCCGGA TCGGCCGGGT CAAGGTCGGC
GAGCCGATGG GCGCGCTCAC GTCGCGGCTG GCGGATGGCC GGGCGCGGCT GGGTGGCGCG
ACGGTGTCCC GGACGGCTGG CCGCTGGTTC GTGGCGTTCA CCGTCGACAC CGACCGGGAC
GTTTCCGAAC GGCCGACCCG CCGTCAGTGG ACGGGCGGCA CGGTCGGCGT CGACCTGGGC
GTGAAACACC TCGCGGTCCT CTCCACCGGC GAGACGGTGG CGAACCCGAA ACGGCACGCC
GCCGCGCTGC GGAAACTGCG CCGCGCGTCG CGGGCCTATG CCCGGTCGAA GCCGGGTAGC
GCTGGGCGCC GGCAGCGCGC CGCCGGGCTC GCGACGATCC ATGCCCGGGT CGCGAACCAG
CGCCGCGACG GGCTGCACAA GCTCACGACA CGGCTCGCCC GGTCCCACGA CGTGATCGTG
GTCGAGGATC TACACGTCGC CGGGATGGTC CGCAACCGGC GGCTCGCCCG CGCCGTCTCG
GACGTCGGGA TGGGTGAGAT CCGCCGGCAA CTCGACTACA AGACCCGCTG GTACGGTTCG
CGGCTGCACG TCGCGGACCG CTGGTATCCG TCCTCGAAGA CCTGTTCCGG CTGCGGCTGG
CGAAACCCAA GCCTGACGCT GTCGGACCGC ACGTTCCGCT GCCAGTCCTG CGGGCTGGTG
GCCGACCGCG ACCACAACGC CGCGATCAAC CTCAGACACC AGGTCGCCGC CAGTACGTCG
GAGACCGTAA ACGCCCGTGG AGCCGACCAT AAGACCCGCA CGAGCGGGCA GGTGGCTGGG
AAGCGGGAAC CTGGCACGGC CAAGGCCGGT CAGACCAGGA GTGCCGGCGC GCAAGTGCCG
GCGGCGTGA
 
Protein sequence
MRTLQAYRFA LDPNDAQAAN LRRHAGAARF AYNWGLARVK AAFAQRDAEK SYGLDGDLLT 
PVPWTLPALR LAWNAAKRDV APWWDECSKE AYSAGLDQLA RALKNFTDSR KGKRNGRRVG
FPRFKKRGKA RDSFRYTTGA YGPATDLYVK LPRIGRVKVG EPMGALTSRL ADGRARLGGA
TVSRTAGRWF VAFTVDTDRD VSERPTRRQW TGGTVGVDLG VKHLAVLSTG ETVANPKRHA
AALRKLRRAS RAYARSKPGS AGRRQRAAGL ATIHARVANQ RRDGLHKLTT RLARSHDVIV
VEDLHVAGMV RNRRLARAVS DVGMGEIRRQ LDYKTRWYGS RLHVADRWYP SSKTCSGCGW
RNPSLTLSDR TFRCQSCGLV ADRDHNAAIN LRHQVAASTS ETVNARGADH KTRTSGQVAG
KREPGTAKAG QTRSAGAQVP AA