Gene Franean1_1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1984 
Symbol 
ID5670385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2383614 
End bp2385593 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content70% 
IMG OID641240905 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_001506327 
Protein GI158313819 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.614445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCTCCC GTGTGAGCAT CGGCCATATT CATGCCTCCG CGCCGGCGAA ATCCGGGCCG 
CCGAAGGACA CTCCCCCGAA ACCCGCGCCG CCGCCGATGC CGGGCTGGCG CAGGTTCCTC
ATCCCGGTCG GGCTGCTCGT CACGGTGATG CTGCTGATCG CTCCGAGCCT GTTCGCCACC
CAGCCGGATT CGCTGACCTA CTCCGATTTC GTGGGTCGGG TCGACAGCGG CGTGGTGCGC
TCGGTCACCA TCGATGACCG CGGTGGTGTC GACGGCACGC TGACCGACGG CACTGACTTC
ACCACCCAGA TCCCCACCGC ACTCGACACA ACCGCGCTCG AACGTCAGCT CGCCGCCAAG
AAGGTGCAGA TCACGGCCAC CAGGACGGGC ACGTCGTTCT GGTCGGTGGT GCTGAGCTTC
CTGCCGCTGC TGCTCCTGAT CGGTTTCTTC GTGTGGTCCG GCCGCATGGC CCGGCGCCAG
CTCTCCGGCG GCGGCGCGCT CGGAATGTTC GGCCGCTCCC GGGCGAAGAT CACCGAGGCG
GACCGGCCTG ACACTCGTTT CAGCGACGTC GCGGGCTACG AGGGCGCGAA ACAGGAGATC
AGCGAGGTCG TCGCCTTCCT GCGCAATCCC GACCAGTATC TTGAAGTCGG CGCACACGGC
CCGCGCGGCG TGCTGATGGT CGGCCCGCCC GGCACCGGCA AGACCCTGCT CGCGCGCGCC
GTCGCCGGCG AGGCCGAGGT GCCGTTCCTT TCGATCACCG GCTCGGGCTT CGTCGAGATG
TTCGTCGGTG TCGGCGCATC CCGGGTGCGG GACCTCTTCA CCGAGGCTCG CAAGCGGGCG
CCGTCCATCA TCTTCATCGA CGAGATCGAC GCCATCGGCG GCCGGCGCGG CTCCAGCGCG
TTCGGCGGGT CCAACGACGA GCGCGAGCAG ACGCTGAACC AGCTCCTCGC CGAGATGGAC
GGGTTCGAGT CCACGTCGGG CGTGGTCGTC CTCGCCGCCA CGAACCGCCC CGAGACCCTT
GACCACGCGC TGCTGCGCCC GGGTCGGTTC GACCGCCAGG TCACAGTGCC GCTGCCGACC
CAGTCCGAAC GGGCCGAGAT CCTCGCCGTG CACACCCGCG GCAAGGCACT CACCGATGAC
GCCGACCTCA CCCGGATCGC CCGGGGCACG CCCGGATTCT CCGGCGCGGA CCTGGCCAGC
CTCGTCAACG AGGCGGCGAT CAACGCCGTC CGCGACGGAC GTTCGGTGGT CAGCGCCGCC
GATCTCGACG CCGCGCGTGA CCGCATTCTC CTCGGACGGC GCGATGCCTC GAACGCCCTG
CTCCCGGACG AGAAGCGGTC CGTCGCCGTG CACGAGTCGG GCCATGCCCT GGTGGCGGCA
CTCTGCGACG ACGCCGACCC GGTCGCGAAG GTGACCATCC TCCCCTCGGG CATGGCGCTC
GGCGTCACCC AGCAGCTCCC CGAGGCCGAG CGGCACCTCT ACTCCGAGGC TTATCTGCTG
GACAGCCTGG CCGTGCGGCT CGGCGGTCGG GCGGCCGAGC TGGTGGTGTT CGGCCACGGC
TCCACCGGTG CCTCGAACGA CCTGGCCGGC GCGACCCAAC TCGCCACCCG GATGGTGCGT
GAGTTCGGGC TGTCGGAGGA GATCGGACCG GTGGGCTACT CGTCCGACGG GCCCAACTTC
CTCGGCGGGG ACGACCTCAT GGCCCGCCCC TACTCGGAGC AGACGCAACG AGTGATCGAC
GCCGAGGTGG CACGGCTGCT GCGGGAGGCT CAGGCGCGAG CCGTCGACCT GCTGCGCATG
CATCGGAACG CACTCGACGC CCTGACCGCG CGCCTGCTGG AACGGGAGAC CGTCGACGGC
ACGGTGGTAG AGGAGCTCGC CGCCGCGTCG ATGGCGAGCT TCACGCGCAG CCCGAACGGA
GACGGCTCGG GGGAAGGTCC GGACGGCGGC ATCCCACCGC AGATCTCCCT GCAGACCTGA
 
Protein sequence
MISRVSIGHI HASAPAKSGP PKDTPPKPAP PPMPGWRRFL IPVGLLVTVM LLIAPSLFAT 
QPDSLTYSDF VGRVDSGVVR SVTIDDRGGV DGTLTDGTDF TTQIPTALDT TALERQLAAK
KVQITATRTG TSFWSVVLSF LPLLLLIGFF VWSGRMARRQ LSGGGALGMF GRSRAKITEA
DRPDTRFSDV AGYEGAKQEI SEVVAFLRNP DQYLEVGAHG PRGVLMVGPP GTGKTLLARA
VAGEAEVPFL SITGSGFVEM FVGVGASRVR DLFTEARKRA PSIIFIDEID AIGGRRGSSA
FGGSNDEREQ TLNQLLAEMD GFESTSGVVV LAATNRPETL DHALLRPGRF DRQVTVPLPT
QSERAEILAV HTRGKALTDD ADLTRIARGT PGFSGADLAS LVNEAAINAV RDGRSVVSAA
DLDAARDRIL LGRRDASNAL LPDEKRSVAV HESGHALVAA LCDDADPVAK VTILPSGMAL
GVTQQLPEAE RHLYSEAYLL DSLAVRLGGR AAELVVFGHG STGASNDLAG ATQLATRMVR
EFGLSEEIGP VGYSSDGPNF LGGDDLMARP YSEQTQRVID AEVARLLREA QARAVDLLRM
HRNALDALTA RLLERETVDG TVVEELAAAS MASFTRSPNG DGSGEGPDGG IPPQISLQT