Gene Franean1_0116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0116 
Symbol 
ID5668541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp137538 
End bp138677 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content71% 
IMG OID641239044 
Productaminoglycoside phosphotransferase 
Protein accessionYP_001504489 
Protein GI158311981 
COG category[R] General function prediction only 
COG ID[COG3173] Predicted aminoglycoside phosphotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGACG AGGACGCGTT CGACGTACCC GCGGTGGACG GCTGGCTTCG GCGCCGGCTG 
CAGGAGCGGG CCGCGGTCGG CGCGGCCGAG ATGGGGGTCG CGACCGGCGC GGGGGTGCTG
ACCGAGCGCC CGCCGGTCGC CCGTTCCATC CCGCCGCCGG GCCGCCCGCA GGTCCGGCAG
TTCTCCGGCG GCGCGTCCAA TCTGACCTAT CTCCTGCGCT ACGAGGACCG CGACCTCGTC
CTGCGGCGGC CGCCGCACGG GCGGAAGGCG TCCGGCGCCC ACGACATGGC GCGGGAGTAT
CGGGTGCAGG CCCGGCTGCG CCCGGCGTTC CGGTACGTCC CGCGGATGGT CGCCTTCTGC
GACGACCCCA CGGTGATCGG CTCGGAGTTC TACATCATGG AGCGGGTGCC CGGCGTCATC
CCGCGCTCGG AGTTCCCGCG CTCACTCACC TTCGATCCGG AGCGGACGCG CCGGCTGGCC
TTCCAGGTCG TGGATCTCCT GGTCGCCCTG CACGACATTG ACCCGGTCGC CTACGGGCTG
TCCGACCTGG GGCGCGGTGC CGGGTATGTC GATCGGCAGC TCACCGGCTG GGCCCGGCGC
TACCGGGACG CCCGCACGCC GAACGTGCCC TCCTTCGAGG TCGTCATGCG CTGGCTCACC
GAGTACGCCC CCGAGGACGT CGCCACCTGC GTCATCCACA ACGACTTTCG CATCGACAAC
GTGGTCTTCG ATGTGGCCCG CATCGGTGAC GACGGCCTGC CCAGGATCAG TGGTGTCCTC
GACTGGGAGA TGGCCACCCT CGGCGATCCG CTCATGGACC TCGGGGGTGC CCTCGCCTAC
TGGGTGCAGG CTGACGACGA CGCCCTCTTC CGGCTCACCC GGCGCCAGCC GACCCACAGC
CCCGGCATGC CCACCCGGGC GGAGATCGTC GAGTACTACG CGGCACGGCG CGGACTGGAC
GTCGGCAGAT GGCCCTTCTA CCAGGTGTTC GGGCTGTTCC GGCTGGCTGT CATCGCGCAG
CAGATCTACT TCCGCTACCA CCACGGCCAG ACGACCAACC CGGCGTTCCG CGAGTACTGG
CAGGTCGTCA CACACCTCGA GAAGCGGTGC CTGCGCGTGA TGGCGGCGGC CGGCCTCTAG
 
Protein sequence
MRDEDAFDVP AVDGWLRRRL QERAAVGAAE MGVATGAGVL TERPPVARSI PPPGRPQVRQ 
FSGGASNLTY LLRYEDRDLV LRRPPHGRKA SGAHDMAREY RVQARLRPAF RYVPRMVAFC
DDPTVIGSEF YIMERVPGVI PRSEFPRSLT FDPERTRRLA FQVVDLLVAL HDIDPVAYGL
SDLGRGAGYV DRQLTGWARR YRDARTPNVP SFEVVMRWLT EYAPEDVATC VIHNDFRIDN
VVFDVARIGD DGLPRISGVL DWEMATLGDP LMDLGGALAY WVQADDDALF RLTRRQPTHS
PGMPTRAEIV EYYAARRGLD VGRWPFYQVF GLFRLAVIAQ QIYFRYHHGQ TTNPAFREYW
QVVTHLEKRC LRVMAAAGL