Gene Franean1_2645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2645 
Symbol 
ID5671038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3127470 
End bp3129038 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content70% 
IMG OID641241560 
Producttransposase IS4 family protein 
Protein accessionYP_001506980 
Protein GI158314472 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTCCGT ACGTGCGGAC GGTGAAGACG GCGTCGGGTG CGCGGGCGGT CCAGATCGTG 
CACTCGCAGC ACAAAGGTTC GCGGGAGATC GAGCACGTCG GGTCGGCGCA CATCGACGCC
GACCTGGAGC TGCTCAAGGC GGTGGCGCGG CAGCGGCTGG CCGCGGGGCA GGGCGAGCTC
GACCTGCGGC TGCCCGGGAG CCCTGCGAAC TCCGGCGCCG CCTTGCCGAT CACGTCGACG
CGGGCGGGCC ACCTGCTCGA CGCGCTGGCC CGAGGCTACG AGGCGCTGGG GTTCGGTGCC
GTGACCGGGC GGGACGAGGT GTTCCAGGCG TTGGTGCTCG CACGGGTAGT CGAGCCGACC
AGCAAGCTCG ACTCGCTGCG CGTGCTGGAA GAAGTCGGTG CCCCGGCGCC GTCCTACCGG
ACCGTCCAGC GCCGGCTGCG TCGCTACGCC GGCGTCGACG AGGTCGACGC CGAGACCGGG
CAGCCCGTTC CCGTCGACCC GGCGGGCGGG CCGTGGCGGG CACGGCTGTC GCGGGCCTGC
GCGGACCACG TCAAGCTCGG CCCGGCGATT CTGTTGCTGT ACGACGTGAC GACGCTGTAC
TTCGAGACCG ACCAGGGCGA CGGGTTCCGC GAGCCGGGGT TCAGCAATGA ACGGCGCCTG
GAACCGCAGG TCACCGTCGG CCTGCTCACC GACGGGGCGG GGTTCCCGCT GACGGTGCAC
GCCTTCGAGG GCAACCGCGC GGATACCACC ACGATGCTGC CCGTCCTGAC CGCCTTCCTG
AAAGCGCACG ACCTGCGCGA CGTGACAGTC GTGGCGGACG CCGGCATGGT CTCGGAGGCG
AACAAACGCG CGATCGAAGC AGCCGGGCTG TCGTTCGTCC TCGGCGCCCG CGTCCCCGAG
GTGCCCTACC TGGTCAAGGC GTGGCGCGAA CGGCACCCGG ACACCGAAAT CCCCGACGGG
CACGTGTTCG TCCAGCCCTG GCCCGCCGGC CCGTCCGACA ACCGGCGCGA CCACACGGTC
TTCTACCAGT ACAAGGCCGA CCGAGCCCGT CGCACGCTGC GCGGCATCGA CCAGCAGGTC
GCCAAGGCCG AGAACGCCGT CGCAGGCAAG ACCGCGGTGA AACGGAACCG GTACGTGCGC
CTGACCGGTG CGAAGAAGTC GGTCAACCGG GCGCTCGAAG AGAAGAACCG CGCCCTGGCC
GGGATCAAGG GCTACGTCAC CAACTTGCCG AACCCCGACC CCGACCAGGT CATCAGCACC
TACAGCCAGC TGCTGAACGT CGAGAAGAGC TTCCGGATGA GCAAGTCCGA CCTCGCGGCC
CGGCCGATCT ACCACCACAC CCGAGAGTCG ATCGAGGCGC ACCTGACAGT CGTGTTCGCC
GCCCTCGCCG TCAGTCGCTG GATCGAGAAC ACCACCGGCT GGTCCGTCCG CAAGTTCGTC
AAGACCGCGC GCCGCTACCG CACCGTCACG ATCCAGGCCG GAGAGCACAC CATCACCGCA
GCCGATCCCG TTCCCGACGA CCTACGAGCC GCCCTCGACG CCGTCCACGG CACGCACCAA
TTGGCCTAA
 
Protein sequence
MSPYVRTVKT ASGARAVQIV HSQHKGSREI EHVGSAHIDA DLELLKAVAR QRLAAGQGEL 
DLRLPGSPAN SGAALPITST RAGHLLDALA RGYEALGFGA VTGRDEVFQA LVLARVVEPT
SKLDSLRVLE EVGAPAPSYR TVQRRLRRYA GVDEVDAETG QPVPVDPAGG PWRARLSRAC
ADHVKLGPAI LLLYDVTTLY FETDQGDGFR EPGFSNERRL EPQVTVGLLT DGAGFPLTVH
AFEGNRADTT TMLPVLTAFL KAHDLRDVTV VADAGMVSEA NKRAIEAAGL SFVLGARVPE
VPYLVKAWRE RHPDTEIPDG HVFVQPWPAG PSDNRRDHTV FYQYKADRAR RTLRGIDQQV
AKAENAVAGK TAVKRNRYVR LTGAKKSVNR ALEEKNRALA GIKGYVTNLP NPDPDQVIST
YSQLLNVEKS FRMSKSDLAA RPIYHHTRES IEAHLTVVFA ALAVSRWIEN TTGWSVRKFV
KTARRYRTVT IQAGEHTITA ADPVPDDLRA ALDAVHGTHQ LA