Gene Franean1_1461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1461 
Symbol 
ID5669865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1757384 
End bp1759003 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content69% 
IMG OID641240381 
Producttransposase IS4 family protein 
Protein accessionYP_001505807 
Protein GI158313299 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.646816 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGACGG CCTCGGGTGC GACGGCGGTG CAGATCGCCG AGTACGTCGG TGGCCGTCGT 
CAGCGGATCG TGGCGCATGT GGGGTCCGCC CATACCGAAG CGGAGCTCGG GATCCTGCTG
GCGCGGGCCG AGGAGATGCT CGCCGACAGC CAGCAGGCGG CGCTCGACCT CGGGATCGAG
CCGGCCGTGC GCACGGCCAG GCTGCTCGGG TCGCCGCGTG AGCCGGCGCT GTTCGACCCC
GAGCCCGCCG CCGGACCTGC CGCGGTGGTC GGGCCCGCGA AGGTGCTCAC GACCGCGTCG
GTGCTGCTGT TCGACGCCCT GGCCTCGGTG TTCACCGATC TCGGGTTCGA CGCGTTGGGC
GACCCGGTGT TCCGGGACCT GGTGATCGCC CGGGTCGTGG AGCCGACGTC GCTGCTCGAC
ACCGGCCGGG TGCTCACCGA CTTGGGACGC AAGCCCGCGG CGTACGCGAC GATGAAACGC
ACCCTGACCC GCTGCGCCTC CGGCGGCTAC CGCGACCAGG TCGCCGACCT GTGCTTTGCC
CACGCCCTGG CCCACGGCGA CGTCTCCCTA TGCCTCTATG ACGTGACCAC GCTGTACTTC
GAGGCCGAGA AGGAAGACGA CCTACGCAAG GTCGGCTACT CCAAGGAACG TCGCGTCGAC
CCGCAGATCG TCGTCGGGCT GCTTGTCGAC CGTTACGGCT TCCCGCTGGA GATCGGCTGC
TTCGAGGGCA ACCGGGCCGA GACCGCGACG ATCCTGCCGA TCATCCGCCA GTTCAAAGAC
CGTCACCAGC TCGAGAACCT GGTCGTGGTC GCCGACGCCG GCATGCTGTC CGCGACCAAC
CTGCGTGAAC TCGACGACGC CGGGTTCGGG TTCATCGTCG GCTCACGGGT CACCAAGGCG
CCGATCGACC TGGCCTCCCA CTTCCGCTGG CACGGCGACG CCTTCACCGA CGGCCAGGTC
ATCGACACGG TCACGCCCCG CACAGGGCGC AACCGTGACA ACGACACCGA CGTGAAGACC
GAGCCTGTCT GGACCCGTGA CCAGCACCCC AGGTCGTGGC GAGCCGTCTG GGCCTACTCG
GCCAAGCGCG CCGCCCGCGA CAACAAGACG CTGACCGCGC AGGAGAACCG CGCCCGCGCT
GTCGTCGACG GCGAGAAGAC TACCCGCACG CCGAGATTCG TCACCGTCAA GGGCGACGCC
GCCACCCTCG ACGAGGCCAG CCTCACCCGG GCCCGCCGGC TCGTCGGGCT GAAGGGCTAC
GTCACCAACC TGCCGGTCAC CGTCCTGACC GCCGACCAGG TCATCTCGAA CTACCACGAC
CTTTGGCACG TCGAGCAGTC GTTCCGGATG TCGAAGACCG ATCTCGCTGC CCGGCCCATG
TTCGTCCGCA CGAAGGAGGC GATCGACGCC CACCTGACGA TCGTGTTCAC CGCGCTCGCC
GTCGCCCGCA CCGTTCAGAA CCGCACCGGC CTCGCGGTCC GCAACGTGAT CCGACAGCTC
CGCCCGCTGC GCTCCGCGAC CATCGCGATC AACGGCGCCA TCCAGACCTT CCCGCCCGCG
ATCAACCCGG ACAAACAAGC GGTACTCGAC ACCCTCCACG CGGCGGCCGT CACGCACTAA
 
Protein sequence
MRTASGATAV QIAEYVGGRR QRIVAHVGSA HTEAELGILL ARAEEMLADS QQAALDLGIE 
PAVRTARLLG SPREPALFDP EPAAGPAAVV GPAKVLTTAS VLLFDALASV FTDLGFDALG
DPVFRDLVIA RVVEPTSLLD TGRVLTDLGR KPAAYATMKR TLTRCASGGY RDQVADLCFA
HALAHGDVSL CLYDVTTLYF EAEKEDDLRK VGYSKERRVD PQIVVGLLVD RYGFPLEIGC
FEGNRAETAT ILPIIRQFKD RHQLENLVVV ADAGMLSATN LRELDDAGFG FIVGSRVTKA
PIDLASHFRW HGDAFTDGQV IDTVTPRTGR NRDNDTDVKT EPVWTRDQHP RSWRAVWAYS
AKRAARDNKT LTAQENRARA VVDGEKTTRT PRFVTVKGDA ATLDEASLTR ARRLVGLKGY
VTNLPVTVLT ADQVISNYHD LWHVEQSFRM SKTDLAARPM FVRTKEAIDA HLTIVFTALA
VARTVQNRTG LAVRNVIRQL RPLRSATIAI NGAIQTFPPA INPDKQAVLD TLHAAAVTH