Gene Franean1_0298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0298 
Symbol 
ID5668722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp350427 
End bp352688 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content68% 
IMG OID641239228 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_001504670 
Protein GI158312162 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.370393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCCAA GAAGAATCTT CCGCGGCTGG GTACCTCTGC TGCTCCTGGT CCTCTTCGTG 
ATCATCCTCA CGACGGGCGT GCTGTCCGGA CCGAGTGAGT ATCAGAAAGC AGACCTGAGC
TTCGTACAGC AGCAGATCGA CCAGAGCGCC GGAGCAAAAC CCGAGACCAG GGTCGTCGAT
GCGACGATCC AGGACTCCAA GCAGATCATC CGGATCGAGC TGGGGAACGG CAAGAAGTAC
GAGTCGTCCT TCGCCACCGA GCAGGCCCTG GTCCTCGCCA ACGAGCTCAA GCAGCAGAAC
ATCAAGTACA ACGTCTCCGT CGACCGCGGG AACGTCCTCG TCTCGCTGCT ACTCAACCTG
CTCCCCGTGC TGTTGATCGT GTTCCTGCTG CTGTTCTTCA TGAACCAGAT GCAGGGCGGC
GGCAACCGTG TCATGAACTT CGGCAAGTCC AAGGCGAAAC TGGTCAGCAA GGACACACCC
AAGACCACGT TCGCTGATGT CGCCGGCGCG GACGAGGCGA TCGAGGAGCT CCAGGAGATC
AAGGAGTTCC TCGAGAACCC GGGCAAGTTC CAGGCGATCG GCGCCAAGAT CCCCAAGGGC
GTCCTGCTCT ACGGGCCGCC CGGCACCGGC AAGACGCTGC TCGCCCGCGC CGTCGCAGGT
GAGGCCGGCG TCCCGTTCTA CTCGATCTCC GGCTCCGACT TCGTCGAGAT GTTCGTCGGC
GTCGGTGCGA GCCGGGTCCG CGACCTGTTC GAGCAGGCCA AGGCGAACGC GCCCGCGATC
ATCTTCGTGG ACGAGATCGA CGCCGTCGGC CGGCACCGCG GGGCGGGCCT GGGCGGTGGC
CACGACGAGC GCGAGCAGAC CCTCAACCAG CTCCTGGTCG AGATGGACGG GTTCGACGTG
AAGGGCGGCG TCATCCTGAT CGCCGCGACC AACCGACCCG ACATCCTCGA CCCGGCCCTG
CTGCGTCCCG GCCGGTTCGA CCGCCAGATC GTGGTCGACC GCCCCGACCT CCTCGGTCGC
GAGGCGATCC TGAAGGTCCA CGCCAAGGGC AAGCCGATCA GCTCGGACGT CGACATGCTG
ATCATCGCCC GGCGCACCCC CGGCTTCACC GGCGCGGACC TGGCGAACGT GCTCAACGAG
GCGGCGCTGC TCGCCGCGAG GTCGGACGTC CGGTTCATCT CGTCGGCGCT GCTCGAGGAG
TCGATCGACC GGGTCATGGC CGGGCCGGAG CGCAAGACCC GCGCGATGAG CGACCGGGAG
AAGAAGCGGA TCGCCTACCA CGAGGGCGGT CATGCCCTGG TGGCGCACGC GCTCCCCAAC
GCCGACCCGG TCCACAAGAT CACGATCCTG CCGCGTGGCC GGGCGCTCGG GTACACGATG
CAGCTCCCCC TGGAGGACAA GTACCTGTCG ACCAGGTCGG AGATGCTCGA CCGGCTCGCC
GTCCTCCTCG GCGGGCGCAC CGCGGAGGAG CTCGTCTTCC ACGAGCCGAC CACCGGGGCC
AGCGACGACA TCGAGAAGGC GACCCAGATC GCCCGCGCGA TGATCACCCA GTACGGCATG
AGCGACAAGC TCGGCGCGAT CAAGTTCGGC AGCGAGTCCG GTGAGGTCTT CCTCGGCCGC
GACATGGGTC ACCAACGCGA CTACTCCGAG GAGGTCGCGA GCGAGATCGA CGACGAGGTG
CGCCGGCTGA TCGAGGCCGC GCACGACGAG GCCTGGGAGA TCCTGGTCAC CTACCGGGAC
GTCCTCGACA ACCTCGTCCT GCGGTTGATG GACACCGAGA CCCTGAGCAA GGACGACGTG
CTCGAGGTCT TCGCCACCGT CCAGAAGCGC CCCAGCCGTG GTTCGTACAC CGGCGTGGGG
CGGCGCATCC CGTCGGACCG ACCGCCGGTG CAGACCCCGG CCGAGCTGGG TCTCGTCCCG
TCCAAGGTCT CTGACCTGGT GAAGGGCAAC AGTGGGCCGG CCGGCCATGC CACGAACGGC
GGCGGCACGA ACGGCGGTGG TTCGGCGAAC GGCGGTGGGG TGAGCGGTGG CAACGGCACC
GCGGGCCACC CCCGCCCGGC AGGCCAGGCC GACCCGGGCA CCGCCGGAGG TCCCGCGGGC
CTCGGGCACG GCGACCCCGC GCAGAGCGGG CCTGGACACA GCGCCTCGAG CCCGGCCCAC
GGCACACCCC CGCCGGGATC CCCGCCGCCG GAAGGCCCCC GGATCGCGAA CCCCTGGGCA
CCGCCCACCT GGGACAACGA TGACGACAGG AGACGCCGTT GA
 
Protein sequence
MTPRRIFRGW VPLLLLVLFV IILTTGVLSG PSEYQKADLS FVQQQIDQSA GAKPETRVVD 
ATIQDSKQII RIELGNGKKY ESSFATEQAL VLANELKQQN IKYNVSVDRG NVLVSLLLNL
LPVLLIVFLL LFFMNQMQGG GNRVMNFGKS KAKLVSKDTP KTTFADVAGA DEAIEELQEI
KEFLENPGKF QAIGAKIPKG VLLYGPPGTG KTLLARAVAG EAGVPFYSIS GSDFVEMFVG
VGASRVRDLF EQAKANAPAI IFVDEIDAVG RHRGAGLGGG HDEREQTLNQ LLVEMDGFDV
KGGVILIAAT NRPDILDPAL LRPGRFDRQI VVDRPDLLGR EAILKVHAKG KPISSDVDML
IIARRTPGFT GADLANVLNE AALLAARSDV RFISSALLEE SIDRVMAGPE RKTRAMSDRE
KKRIAYHEGG HALVAHALPN ADPVHKITIL PRGRALGYTM QLPLEDKYLS TRSEMLDRLA
VLLGGRTAEE LVFHEPTTGA SDDIEKATQI ARAMITQYGM SDKLGAIKFG SESGEVFLGR
DMGHQRDYSE EVASEIDDEV RRLIEAAHDE AWEILVTYRD VLDNLVLRLM DTETLSKDDV
LEVFATVQKR PSRGSYTGVG RRIPSDRPPV QTPAELGLVP SKVSDLVKGN SGPAGHATNG
GGTNGGGSAN GGGVSGGNGT AGHPRPAGQA DPGTAGGPAG LGHGDPAQSG PGHSASSPAH
GTPPPGSPPP EGPRIANPWA PPTWDNDDDR RRR