Gene Francci3_4314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4314 
Symbol 
ID3907283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5154370 
End bp5156631 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content68% 
IMG OID637881642 
ProductMername-AA223 peptidase 
Protein accessionYP_483389 
Protein GI86742989 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.492417 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCCAA GAAGAATCTT CCGCGGCTGG GTGCCACTGC TGCTCCTGGT CCTGTTCGTG 
ATCATTCTCA CGACGGGCGT CCTCTCGGGT CCCAGTGAGT ACGGCAAGCG AGACCTGAAC
TTCGTCCAGC AGCAGATCGA CGAAGGCCAG GTGGCGAAGG CCAAGATCCA GGATTCGAAG
CAGCTCATCC AGATCCAGAC GAAGGACGGC CAGAAGTTCG AGTCGTCCTA TGTCACCGAA
CAGGGTGTCG TCCTGGCGAA CGAGCTCCGC AACAAGCGGG TCGCCTACGA CGTCTCGGTC
GACCGTGGAA ACATCCTGGT CTCGCTCCTG CTGAACCTGC TGCCCGTACT GCTGGTCGTC
CTTCTGCTGT TCTTTTTCAT GAACCAGATG CAGGGCGGCG GTAACCGGGT CATGAACTTC
GGCAAGTCCA AGGCGAAGCT GGTGAGCAAG GACACGCCGA AGACGACGTT CGCCGACGTG
GCCGGTGCGG ACGAGGCCAT CGAGGAGCTC GAGGAGATCA AGGAGTTCCT CGAAAACCCG
GGCAAGTTCC AGGCGATCGG AGCCAAGATT CCCAAGGGGG TCCTGCTCTA CGGCCCGCCG
GGAACCGGCA AGACGCTGCT GGCCCGGGCC GTCGCGGGTG AGGCCGGGGT GCCCTTCTAC
TCCATCTCGG GTTCCGACTT CGTCGAGATG TTCGTCGGTG TCGGCGCCAG CCGGGTTCGC
GACCTGTTCG AACAGGCCAA GGCCAACGCG CCCGCCATCA TCTTCGTTGA CGAGATCGAC
GCCGTGGGCC GCCACCGCGG AGCGGGACTC GGCGGTGGCC ACGACGAGCG GGAGCAGACG
CTCAACCAGC TCCTGGTCGA GATGGACGGG TTCGACGTCA AGGGCGGGGT CATCCTCATC
GCCGCGACGA ACCGGCCCGA CATCCTCGAC CCGGCCCTGC TGCGGCCCGG CCGCTTCGAC
CGGCAGATCG TCGTGGATCG GCCGGACCTG CTCGGCCGCG AGGCCATCCT GCGGGTGCAC
GCCAAGGGCA AGCCGATCGG CCCGGACGCC GACATGATGG TCATCGCCCG GCGGACCCCC
GGGTTCACCG GTGCCGATCT GGCCAACGTA CTGAACGAGG CAGCCCTGCT GGCCGCCCGC
TCCAACCTGA AATTCATCTC GTCGGCGCTG CTCGAGGAGT CCATCGACCG CGTGATGGCG
GGACCGGAGC GCAAGACCCG CGCGATGAGC GACAAGGAGA AGAAGCGCAT CGCTTACCAC
GAGGGCGGCC ACGCCCTCGT GGCGCACGCG CTGCCCAACT CCGACCCGGT GCACAAGGTG
ACCATCCTGC CCCGGGGGCG CGCGCTCGGG TACACGATGC AGCTTCCCCT GGAGGACAAG
TACCTCTCGA CCCGCTCGGA GATGCTCGAC CGCCTCGCCG TCCTGCTCGG CGGGCGCACC
GCCGAGGAGC TGGTGTTCCA CGACCCGACC ACGGGGGCGA GCGACGACAT CGAGAAGGCC
ACCCAGATCT CGCGAGCAAT GATCACCCAG TACGGGATGA GCGACAAGCT CGGCGCGATC
AAGTTCGGCA CCGAGAACAG CGAGGTCTTC CTCGGCAAGG AGGTCGGCCA CCAACGCGAC
TACTCCGAAG AGGTCGCCAG TGAGATCGAC ATCGAGGTGC GCCGGCTGAT CGAGGCCGCA
CACGACGAGG CCTGGGAGAT CCTGGTGACC TACCGGGACG TACTGGACAA CCTCGTGTTG
CGGCTGATGG ACACCGAGAC GCTGAGCAAG GACGAGGTCG CCGAGGTCTT CGCGACGGTC
CAGAAACGCC CCGTGCGTGG CATCTACACC GGGGTGGGGC GTCGCGTCCC CTCCGACCGG
CCACCGGTGC AGACCCCGGC CGAGCTCGGC CTGCTGACGG CGGATGTCGC CGACCTGGTG
AACGACCCGG GCCACGGTAA TGGTGCCGCC GGACGCGGAC CGGGCGGCAA CGGAACCGGG
GGCAACAGGG CCGGCGGTCA CGGTCAGCCC GCCGGCGTGC CGGGGGCACC AGCGGGCTCG
GGCCCCGGGG TCGACCCGTC CGGTCCGGGC GGGGGCGTAC CGCACGGCAC GGATGTCGGT
GGTCCGGGTG GGAATGGCAG CCCCGGGGGG CACGGTTATC CCCAGGGCCA GGGCACGCCG
GCGCCCCCCG GAGGCGAGGG TCCGGCCCAT GACGGGCCGA GGATCGCCAA CCCGTGGGCC
CCGCCAAGCT GGCCGAGCGA CGACGAGAGG AGACGCCGTT GA
 
Protein sequence
MTPRRIFRGW VPLLLLVLFV IILTTGVLSG PSEYGKRDLN FVQQQIDEGQ VAKAKIQDSK 
QLIQIQTKDG QKFESSYVTE QGVVLANELR NKRVAYDVSV DRGNILVSLL LNLLPVLLVV
LLLFFFMNQM QGGGNRVMNF GKSKAKLVSK DTPKTTFADV AGADEAIEEL EEIKEFLENP
GKFQAIGAKI PKGVLLYGPP GTGKTLLARA VAGEAGVPFY SISGSDFVEM FVGVGASRVR
DLFEQAKANA PAIIFVDEID AVGRHRGAGL GGGHDEREQT LNQLLVEMDG FDVKGGVILI
AATNRPDILD PALLRPGRFD RQIVVDRPDL LGREAILRVH AKGKPIGPDA DMMVIARRTP
GFTGADLANV LNEAALLAAR SNLKFISSAL LEESIDRVMA GPERKTRAMS DKEKKRIAYH
EGGHALVAHA LPNSDPVHKV TILPRGRALG YTMQLPLEDK YLSTRSEMLD RLAVLLGGRT
AEELVFHDPT TGASDDIEKA TQISRAMITQ YGMSDKLGAI KFGTENSEVF LGKEVGHQRD
YSEEVASEID IEVRRLIEAA HDEAWEILVT YRDVLDNLVL RLMDTETLSK DEVAEVFATV
QKRPVRGIYT GVGRRVPSDR PPVQTPAELG LLTADVADLV NDPGHGNGAA GRGPGGNGTG
GNRAGGHGQP AGVPGAPAGS GPGVDPSGPG GGVPHGTDVG GPGGNGSPGG HGYPQGQGTP
APPGGEGPAH DGPRIANPWA PPSWPSDDER RRR