Gene Franean1_5850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5850 
Symbol 
ID5674173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7097412 
End bp7098611 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content70% 
IMG OID641244700 
ProductIS605 family transposase OrfB 
Protein accessionYP_001510102 
Protein GI158317594 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.707323 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGCGGG CTTTCAGGTT CCTGCTCCGC CCGAGCGTGA AGCAGGCCGC CGCGCTGACG 
GCGATGCTCG ATGATCACCG GGCACTTTAC AACGCCGCGT TGCAGGAACG ACGTGACGCC
TACCGGCATC CGTCGAAGGC GACGGTCCGC TACGGCGACC AGTCCGCCCA GCTCAGGGAG
ATCCGCGCCT GCGACCCGGA TCAGGGCCGC TGGTCGTTCT CCTCCCAGCA GGCCACCCTG
CGTCGCCTCG ACAAGGCGTT CGCCGGCTTC TTCCGCCGCG TCAAAGCAGG CGAGACCCCT
GGCTACCCGC GGTTCAGAGG CGCGGGCCGG TTCGACACGG TCGAGTGGCC GAGGGACGGG
GACGGCTGCC GCTGGAACTC CCAGCCCGAG CATCCCACCC GGACCCGGGT CCGGCTTCAA
GGTGTCGGTC ACGTCAAGGT TCACCAGCAC CGGCCGGTGG CGGGCACGGT CAAGACGGTC
TCGGTGCGGC GGGAAGGCCG CCGCTGGTAT GTGGTCCTCT CCTGCGACGA CGTGCCCGCG
CGGCCGCTGC CGGCCACCGG GGTGGTGGTG GGGGTGGATA TGGGTGTGGC GTCGCTGGTG
ACCCTCTCCG ATGGCCGTCA GGTCGGTAAC CCGCGTTTTC TTGCCGCGGC GGCCGGTCGG
CTCGCGCGTG CGCAACGGGA ACTGGCCCGT AAGAAGCGGG GGTCGACCCG GCGCCGGAAG
GCCGTCGCGA AGGTCGCCGC GCTGCACGGC AGGGTTCGCC GGCAGCGCCT CGACCTCGCG
CACACGGTCG CCCGCTCCCT GGTCGCTGAC CATGATCTGA TCGCTGTGGA AGCGTTGCGG
ATTGTGAACA TGACTCGCCG GGGCTCGCCG AGACCCGATC CGGACCGGCC CGGAGTGTTC
GTGGCGAACG GGCAGGCGGC GAAGTCCGGG CTGAACAGGA GCGTTCTCGA CGCGGGATGG
GGGGTGTTCC TCGCTGTGCT GCGTGCCAAG GCTGAAAGTG CCGGACGGAC GGTCGTCGAG
GTCAACCCCG CCAACACCTC CCGCACCTGC GCGGTCTGCG GGCACTGCCA CGCCGACAAC
CGCAGAACAC AGGCCGCGTT CACCTGTGTC GCGTGCGGGC ATGCCGCGCA CGCCGATGTG
AACGCGGCGA TCAACATCCT TCGGGCCGGG CTGGCCCGTC AGGCCACCGA AGCGGCCTGA
 
Protein sequence
MRRAFRFLLR PSVKQAAALT AMLDDHRALY NAALQERRDA YRHPSKATVR YGDQSAQLRE 
IRACDPDQGR WSFSSQQATL RRLDKAFAGF FRRVKAGETP GYPRFRGAGR FDTVEWPRDG
DGCRWNSQPE HPTRTRVRLQ GVGHVKVHQH RPVAGTVKTV SVRREGRRWY VVLSCDDVPA
RPLPATGVVV GVDMGVASLV TLSDGRQVGN PRFLAAAAGR LARAQRELAR KKRGSTRRRK
AVAKVAALHG RVRRQRLDLA HTVARSLVAD HDLIAVEALR IVNMTRRGSP RPDPDRPGVF
VANGQAAKSG LNRSVLDAGW GVFLAVLRAK AESAGRTVVE VNPANTSRTC AVCGHCHADN
RRTQAAFTCV ACGHAAHADV NAAINILRAG LARQATEAA