Gene Franean1_4129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4129 
Symbol 
ID5672487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4912028 
End bp4913047 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content70% 
IMG OID641243005 
Productaminoglycoside phosphotransferase 
Protein accessionYP_001508422 
Protein GI158315914 
COG category[R] General function prediction only 
COG ID[COG3173] Predicted aminoglycoside phosphotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.753306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.346085 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGT CCGACCGGAA GATGCACGCC AGCGAGGTCG TCATCGACAC CTCGCTGGTG 
GGCCGGTTGA TCGCCGCGCA GTTCCCCGAG TGGGCGGGCC TTCCCCTCGA ACCGGTCCGC
TCCGCAGGCG CTGACAACGC GATCTACCGC CTCGGCGTGG ACCTGGCAGT ACGGCTACCC
CGCATCCCCG CAGCGGCCGG GCAGGTGGAC AAGGAGCACC GGTGGCTGCC GCAGCTCGCG
CCCCTACTGC CATTGGACAT CCCGGTCCCG CTCGGGACGG GCACACCCGG CGAGGGCTAC
CCGTGGCCCT GGTCGGTCCA CCTGTGGCTG GAGGGCGAGG ACCTGCTCGC CGAACCTGTC
ATTGACCTAC ACCGCATGGC CATCGAACTG GGAAACTTCG TCGCAGCCCT GCAACAGGTC
GACCCCACGG GGGGACCACC TCCCGGGGCA CACAACTTCT TCCGTGGCGC CCCGCTCGCC
CGGCGGGACG CGGCGACCCG GGCCGCCATC CACTCCCTGC GGGCCACCCT CGACACCGCA
GCGGCGACCG CGGCGTGGGA CACGGCCATG CACGCGCCCC GCTGGCAGGG AACGCTGGTA
TGGATCCACG GCGACCTTCT CCCCGGGAAT CTGCTCACCC GGGGCGGCCG GCTGCATGCC
GTCATCGACT TCGGCGGCCT GGGCATGGGA GATCCGGCCT GCGACGTGAT GGCCGCCTGG
ACGCTGCTGT CCACCGAAAG CCGCGAGGCG TTCCGGAGCA CGATCGGAGC CGATGACGCG
ACCTGGGCAC GGGCCCGTGG CTGGGCGCTG TCCTTCGGAC TCATCGCCCT GCCCTACTAC
CAGGACAGCA ACCCCACACT CGCCCACATC GCCCGGCGCA CCATCGACGA GGCCATCACC
GATCCATCCC GGCCGCCGCC CACCGGCACC GCGCCGACGC CGGCGAAACC TGGACCCACT
ACGCCCCGGG CGAGCACAGT CGATGACCAG GAACTTCCTC GCTACTGTCA CATCACATGA
 
Protein sequence
MSESDRKMHA SEVVIDTSLV GRLIAAQFPE WAGLPLEPVR SAGADNAIYR LGVDLAVRLP 
RIPAAAGQVD KEHRWLPQLA PLLPLDIPVP LGTGTPGEGY PWPWSVHLWL EGEDLLAEPV
IDLHRMAIEL GNFVAALQQV DPTGGPPPGA HNFFRGAPLA RRDAATRAAI HSLRATLDTA
AATAAWDTAM HAPRWQGTLV WIHGDLLPGN LLTRGGRLHA VIDFGGLGMG DPACDVMAAW
TLLSTESREA FRSTIGADDA TWARARGWAL SFGLIALPYY QDSNPTLAHI ARRTIDEAIT
DPSRPPPTGT APTPAKPGPT TPRASTVDDQ ELPRYCHIT