Gene Franean1_5400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5400 
Symbol 
ID5673731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6515199 
End bp6516977 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content69% 
IMG OID641244255 
Productmethionyl aminopeptidase 
Protein accessionYP_001509661 
Protein GI158317153 
COG category[J] Translation, ribosomal structure and biogenesis
[V] Defense mechanisms 
COG ID[COG0024] Methionine aminopeptidase
[COG3570] Streptomycin 6-kinase 
TIGRFAM ID[TIGR00500] methionine aminopeptidase, type I 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGGAT TCGAGCTGCC GCGCAACCTC TCATGTGCCG GGGCGGGGCG CCGCCGATCG 
CCCGGTCGTG CTGAAGGTCG GCTGGCGCCG CAGGGAAGCG AACACGAGAC CGATGGTCTG
CGGACCTGGG CCGGGCGGGG AGCGGTGTTC CTCCTCGACG CCCTAGCTAG CCGACGGGGC
GACCAGCGCG CTGCCGCTCG AACGCTGGGA GCCGGGCACG ACTCTCGCCG GGGCACGAGC
GGAACACGAG CAGGACGCGG TCGTCGCCGG ACTGCTGCGC CGACTGTGGA TCACCCCGTC
AGACGGGCTC CGTTCCGCCC GCTGCAAGAC ATGTGCGACG CCTGGGCAGC CGAGTTCGCC
GAACGACTCG ACGCCGCGCC CGGCGCGATC GACCCAGGGC TGGCCCGCGC CGCCATCGAG
TTGTTCCACA CGCTGCCGGG CAGCGTCGAG CGGGAAGTGC TGCTGTGCAC TGACCCGCAC
GCCGGGAACA TCCTGGCGGC CCGGCGCGAG CCGTGGCTGG TCATCGACCC GAAACCCTAC
GTCGGCGATC CCGCCTACGA CCCGGTCCAG CACATGCTCA ACCGAGACGA GGGCCTCGAC
CTCGACCCCG ACCGGGTCAA CCGGTGGCTG TTCGCCCGCT GCGCCCAGCA ATCGATCGAC
GTTGCAGGTC GCCTCGGCCC GGGAGCACGG CTGGACATGG GAACGGATCG CCGCCGCGAT
GGGGAGCACC CGGGCGGTAC ACAAGAAGTA CGTAGCGAGC AGGCGGATCG GACGGAGGCA
GCCATGAGCC GGCGTAAGGC CCACGGCTGC GACGGGCCCA CGGCCCACCT GTCTCGGCCG
ATCAGCCCGG TGCTGACGGC GGCCCGGGAG GAGGCCGAGC AGGCCCGCCA CGGCTATGTC
GGGCCCGAGA CTCACGAGGC CAACCGCTCG CGTGCGTTGC GACCTGATCT TGTGGGTTCG
ACCTCGCCGT CAGGCTGGTT GATCTCGGCC CACACTTCAT CGAGGGAGAG CCCGAGGACA
TCGGCGATCG CCGCGATGGT CGGGAAGGCA GGGGTGGCTA CGCGACCAGA CTCGATCTTC
CGAAGGGTTT CTGGTGAGAC ACCTGCATCT AGCGCGGTTC GGTGGACCAA GACCATGATC
GTCGAGGCTG GGGCGCAGTC CTGCTACGTC GACTATGAGC CGTCCTTCGG ACGCGGGCCG
TTCGGCCACT ACATCTGCAC GGCCGTCAAC GACGCCGTGC TCCACGGACT GCCCTACGAC
TACACGCTTG CCGACGGCGA CCTGCTGACG CTCGACCTCG CCGTCTCCAG AGACGGAGTC
GCTGCAGACT CCGCCATCAG CTTCATCGTG GGCGACTCAA AGCCCCCGGA GAGCGTCGCG
ATGATCAGCG CAACCGAACG CGCATTGAGC GCAGGGATAG CCGCTGCCGG CCCCGGAGCT
CGCATCGGCG ACATCTCCCA TGCCATCGGC TCCGTCCTCA GCGAGGCAGG GTACCCGATC
AACACCGAGT TCGGAGGTCA TGGCATCGGA TCAACGATGC ACCAGGACCC GCACGTTTCA
AACACCGGAC GGCCCGGCCG TGGATACAGA CTGCGCCCTG GGCTGCTGCT CGCGCTGGAG
CCGTGGGTCA TGGCGGACAC CGCCGAGCTC GTCACCGATG CCGACGGCTG GACCCTCCGA
AGCGCGACAG GCTGCCGGAC AGCGCACAGT GAGCACACGA TCGCCATCAT CAACAACGGA
GCCGAAATCC TCACCTTGCC GACGCAGGCG CACTCGTGA
 
Protein sequence
MSGFELPRNL SCAGAGRRRS PGRAEGRLAP QGSEHETDGL RTWAGRGAVF LLDALASRRG 
DQRAAARTLG AGHDSRRGTS GTRAGRGRRR TAAPTVDHPV RRAPFRPLQD MCDAWAAEFA
ERLDAAPGAI DPGLARAAIE LFHTLPGSVE REVLLCTDPH AGNILAARRE PWLVIDPKPY
VGDPAYDPVQ HMLNRDEGLD LDPDRVNRWL FARCAQQSID VAGRLGPGAR LDMGTDRRRD
GEHPGGTQEV RSEQADRTEA AMSRRKAHGC DGPTAHLSRP ISPVLTAARE EAEQARHGYV
GPETHEANRS RALRPDLVGS TSPSGWLISA HTSSRESPRT SAIAAMVGKA GVATRPDSIF
RRVSGETPAS SAVRWTKTMI VEAGAQSCYV DYEPSFGRGP FGHYICTAVN DAVLHGLPYD
YTLADGDLLT LDLAVSRDGV AADSAISFIV GDSKPPESVA MISATERALS AGIAAAGPGA
RIGDISHAIG SVLSEAGYPI NTEFGGHGIG STMHQDPHVS NTGRPGRGYR LRPGLLLALE
PWVMADTAEL VTDADGWTLR SATGCRTAHS EHTIAIINNG AEILTLPTQA HS