Gene Franean1_1190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1190 
Symbol 
ID5669603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1415793 
End bp1417226 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content73% 
IMG OID641240122 
Productpeptidase M16 domain-containing protein 
Protein accessionYP_001505550 
Protein GI158313042 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0547363 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.134748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGAGA CGGATCTGTC CACCTCGGAC GACCGCACCT CGGACGGTCG CACCTCGAAC 
GACCGCACCC CGGGCCTGGC CGGGCCGGAG CACGCCTCCG GCCCGCGCCC GGCCCGGGTG
GCCGAGCTTC TCGCGGCGGG TCCCGGTTCC GAGGTCCTGC TCGCCGGGGC CGTGCGCCGC
ACGGTGCTGC CGGGCGGCCT GCGGGTCGTC ACGGAGAAGG TCCCCGGGGT CCGGTCGGTG
GCGATCGGGA TCTGGGTGGG CGTCGGCTCG CGGGACGAGA CGCCGCTCAC CGGTGGCTGC
TCGCACTACC TGGAGCACCT GCTGTTCAAG GGCACCCCGA GCCGGGACGC CCTGTCGATC
AGCGCCTCCA TCGAGGCCGT CGGCGGTGAT CTCAACGCCT TCACCGCCAA GGAGTACACC
TGCTACTACG CGCGGGTGCT CGACGTGGAC ATGGACCTGG CCATCGACGT CGTCTGTGAC
ATGGTCGCCA ACTCGCTGGT GACCGCGGAC GACGTCGAGG CCGAGCGGGG CGTGATCCTC
GAGGAGATCG CCATGCACGA GGACGACCCC GGCGACGTCG TGCACGACGT CTTCGCCGAC
GCCGTCCTCG GCTCCTCCGT CCTGGGGCGA CCGGTGCTCG GCACGGTCGA CACCATCGAG
GCGCTCGGCC GGGAGACGGT CTTCGACTAC TACCGCGAGC GGTACGCCCC GCCCGCGCTG
GTCGTCTCGA TCGCCGGCAA CATCGAGCAC GACCACGCCC TGGACCGGGT GGTGGCCGCG
TTCGCCGACC GGCTCACCGG GCCCGCCCGG CACCAGGAGG TGCGCCGCGG CGAGTACCCG
TTCCCGCCGC CGCCGGGCAT CGTCGTCACC AACCGGCCGA CCGAGCAGGC CCACGTGGTG
CTCGGCACGG CCGGCCTGTC CCGGCACGAC CCGCGCCGGT ACGCGCTCGG CGTGCTGTCG
ACGGCCCTCG GTGGCGGGAT GAGCTCGCGG CTGTTCCAGG AGGTGCGGGA GAAGCGCGGG
CTGGCCTACT CCGTGTACTC CTTCGACAAC CAGTTCGCCG ACGCCGGGCT GTTCGGCGTC
TACGCGGGCT GTACCCCGGG GCGCGCCGAC GAGGTGCTGG AGATCTGCCG CGAGCAGGTG
CACCGGATCG CCGAGCACGG CATCACCGCG GAGGAGCTCG AGCGGGCCCG CGGCCAGAAC
CGCGGCGGCC TGGTGCTCAA CCTGGAGGAC ACCGGGTCGC GGATGAGCCG GCTCGGCAAG
AGCGAGCTCG TCCACGGCGA GCTGCTCTCG GTCGACGAGG TGCTCGCCCG GGTCGAGGCC
GTCACACTCG ACGACGTGCG GGCCGTCGCC GGCGAGCTGG TCGACCAGCC GTGGGCGCTC
GGCGTCATCG GCCCGTTCGA GGACCACGAC TTCAGCGCGG CCGTAGCGCG GTGA
 
Protein sequence
MTETDLSTSD DRTSDGRTSN DRTPGLAGPE HASGPRPARV AELLAAGPGS EVLLAGAVRR 
TVLPGGLRVV TEKVPGVRSV AIGIWVGVGS RDETPLTGGC SHYLEHLLFK GTPSRDALSI
SASIEAVGGD LNAFTAKEYT CYYARVLDVD MDLAIDVVCD MVANSLVTAD DVEAERGVIL
EEIAMHEDDP GDVVHDVFAD AVLGSSVLGR PVLGTVDTIE ALGRETVFDY YRERYAPPAL
VVSIAGNIEH DHALDRVVAA FADRLTGPAR HQEVRRGEYP FPPPPGIVVT NRPTEQAHVV
LGTAGLSRHD PRRYALGVLS TALGGGMSSR LFQEVREKRG LAYSVYSFDN QFADAGLFGV
YAGCTPGRAD EVLEICREQV HRIAEHGITA EELERARGQN RGGLVLNLED TGSRMSRLGK
SELVHGELLS VDEVLARVEA VTLDDVRAVA GELVDQPWAL GVIGPFEDHD FSAAVAR