Gene Franean1_1885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1885 
Symbol 
ID5670287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2262972 
End bp2264015 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content73% 
IMG OID641240807 
Productaminoglycoside phosphotransferase 
Protein accessionYP_001506229 
Protein GI158313721 
COG category[R] General function prediction only 
COG ID[COG3173] Predicted aminoglycoside phosphotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.969674 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCGCC CGACGAGAAC CGCCGTCGAG GGGCTGGACC TGCCGGCGCT CGACCGGTTC 
TTCGCCGAGC GGGTCCCCGG TTTCCGCGGC GAGCTGACGG CCGAGCGGCT GTCGGGCGGC
CGCTCGAACC TCACGTACCT GCTCACCGAC GGGACGACCC GCTGGGTGCT GCGGCGGCCG
CCGCTGGGTG GGCTCACCCC GTCGGCCCAC GACGTGCTCC GCGAGCACCG CGTGGTGTCG
GCCCTGTCCG ACAGTGTGGT CCCCGTCCCC CGCGCGGTCG CCCACAGCGA GGGCGACCCG
CTGGGCGTGC CGTTCTCCGT GGTCGAGTAC GTTCCCGGCC CGGTCATCCG CACCGAGGAG
GAGCTGCACG CGCTCCCCCA GGCCGACATC GACCGCTGCG CCCATGCCCT GATCGACGTG
CTGGCGCGGT TGCACTCCGT CGAAGCGGAC GGGGTCGGTC TGGGCGCCTT CGGTCGCCCG
CAGGGCTATC TCGGCCGCCA GGTACGGCGA TGGAACGACC AGTGGCAGCG GATCCGCACC
CGCGCCCTGC CCGATGTCGA CGCCCTGTAC GCCAGGCTCG CCGAGGCACA CCCGGTGGAG
AGCGGCGCGT CGATCGTGCA CGGCGACTTC CGGATCGACA ACGTCATCGT GGCGCCGGAG
GACCCGGCGA CGGTACGCGC GGTGGTGGAC TGGGAGATGG CGACCCTGGG CGACCCGATC
GCCGACCTGG GCGTGCACAT CGCCTATTCC GATCCGGCGT TCGCGCCGGT GCTGGGCGGC
TCGGCGGCGT CCACCAGCCC GCGCCTGCCG GCCGCGAGCG AGCTGACCGG CCGCTACGCC
GAGGTCACCG GCCGGGACCT GAGCAATTTC CCGTTCTACC TTGCGCTCGG CTATTTCAAG
GTGGCGGTGA TCGCGGAGGG CATCCACTAC CGGTTCAGGC AGGGCGTGAC CCGCGGCGCC
GGGTTCGAGT CCGTCGGCGA GGCGACGGCG CCACTCGCCG CGGCCGGGCT ACGCGCCCTG
AACGGAAACC TGGAGACACC TTGA
 
Protein sequence
MRRPTRTAVE GLDLPALDRF FAERVPGFRG ELTAERLSGG RSNLTYLLTD GTTRWVLRRP 
PLGGLTPSAH DVLREHRVVS ALSDSVVPVP RAVAHSEGDP LGVPFSVVEY VPGPVIRTEE
ELHALPQADI DRCAHALIDV LARLHSVEAD GVGLGAFGRP QGYLGRQVRR WNDQWQRIRT
RALPDVDALY ARLAEAHPVE SGASIVHGDF RIDNVIVAPE DPATVRAVVD WEMATLGDPI
ADLGVHIAYS DPAFAPVLGG SAASTSPRLP AASELTGRYA EVTGRDLSNF PFYLALGYFK
VAVIAEGIHY RFRQGVTRGA GFESVGEATA PLAAAGLRAL NGNLETP