Gene Franean1_2958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2958 
Symbol 
ID5671344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3483192 
End bp3484643 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content70% 
IMG OID641241864 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001507284 
Protein GI158314776 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGGTT CGGTCGAGGT CGTCGTGGAC GACGCCGAGC AGATCCTGGA TCGGGTGGCG 
GCGATCGACG TCGCGAAGGC GTCCGGGAAG GTGTGCGTGC GGGTCCCGCA CGACAGCCGG
GAAGGCAGGC GAGTCACCCG CGTCTTCGGT GTCACCGCGA CGGTCCCGGC GGTGGAGGAA
CTCGCCGACC ACCTGGTCTG CCAGGGCGTC CAGCGGGTCG TCGTCGAAAG CACGTCTGAC
TACTGGCGGG TGTTCTACTA CCTGCTCGAG GAGCGGGGCC TGACAGTGTG GCTGGTCAAC
GCCCGGGACG TCAGGAACGT CCCCGGAAGA CCCAAGACAG ACAAGATCGA CGCTGTGTGG
CTGGCGAAGT TGAACGAGCG GGGGATGCTG CGGCGGTCGT TCGTGCCGCC GGTGGCGATC
CGCCGGATGC GGGATGTGAC CCGGATGCGG GTCGACCTCG TCGCGGACCG CACGCGGGTC
AAACAGCGCG CAGAGAAACT ACTCGAAGGC GCGCTGATCA AACTGTCGTC GGTGGTCTCG
GACGTTTTCG GGGTCGCCGG CCGGCGGATC CTCAACGCAC TGATAGCCGG GGAACGCGAC
CCGCGCCGGC TCGCGGCACT CGGGACGGGT CTGAAGGCCT CGCCGGCAGC ACTGACCGCG
GCGCTGACCG GCCGGTTCAG CGACCACGAC GCGTTCATGC TCACGATCTA CCTGGAGCAG
ATCGACGCGC TCGACAAGCA CCTCGCCACG CTGTCCGCGC GGATCGACCA GATGACCGCG
GCGATCCCCC TTCCCACCCG TCGCACCGAC ACCCCCGGCG TCGCTGAGAT CACCACGGTG
CCCGGGGTCG GGACGGTCTC GCCGGTCACC GGTGAGATCA TCCCGCCCGC CGACGCCCCT
CCCGCAGGTG GCACGCCCCC ACCCGCCGGT GGCCTGCCGG GGCCGCAGAC GGTCGCGGAT
CTCGTCGACC TGCTCGACGC GATCCCCGGG ATCGGCAGGG ACGCCGCACA ATTGATCCTC
GCGGAGATCG GCACGGACAT GGCCCCGTTC GCAACCTCCG GGCACCTGGC GTCGTGGGCG
AAGCTGACCG CGCGCACGAT CCAGACCGGG GCGTCGCTAC GGATGGGCCG GACCGGGCGC
GGGAACCGGT ACGTGCGCCG CACACTCGGC ACGGCCGCGG CCTCCGTCGC ACGCACCAAC
ACCTTCCTCG GGGCACGTCA CCGGCGGTTA CGCGCCCGCC GCGGCGCGCT GAAGGCTCTC
GTCGCTACCA GCCGCACCAT CCTAGAGATC ATCTGGCGGA TGGTGCACGA CCAGGTGCCG
TTCCGGGAAC TCGGCGCGGA CTACCACACC CGCCACCAGG ACCCGGACAA GCGCAAGCGA
ACGCTGGCCC GGCAGATGAA AAACCTCGGG CTCTCGCCCG AGGAAGCCGC CGCCATGCTC
GCCGCAGCCT GA
 
Protein sequence
MDGSVEVVVD DAEQILDRVA AIDVAKASGK VCVRVPHDSR EGRRVTRVFG VTATVPAVEE 
LADHLVCQGV QRVVVESTSD YWRVFYYLLE ERGLTVWLVN ARDVRNVPGR PKTDKIDAVW
LAKLNERGML RRSFVPPVAI RRMRDVTRMR VDLVADRTRV KQRAEKLLEG ALIKLSSVVS
DVFGVAGRRI LNALIAGERD PRRLAALGTG LKASPAALTA ALTGRFSDHD AFMLTIYLEQ
IDALDKHLAT LSARIDQMTA AIPLPTRRTD TPGVAEITTV PGVGTVSPVT GEIIPPADAP
PAGGTPPPAG GLPGPQTVAD LVDLLDAIPG IGRDAAQLIL AEIGTDMAPF ATSGHLASWA
KLTARTIQTG ASLRMGRTGR GNRYVRRTLG TAAASVARTN TFLGARHRRL RARRGALKAL
VATSRTILEI IWRMVHDQVP FRELGADYHT RHQDPDKRKR TLARQMKNLG LSPEEAAAML
AAA