Gene Francci3_4320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4320 
Symbol 
ID3907289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5161420 
End bp5162955 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content71% 
IMG OID637881648 
Producthypothetical protein 
Protein accessionYP_483395 
Protein GI86742995 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGGC CGGCCGGACC GCAACCAGCC TCAGCCGACG GAGGAAGCAC GGTGCGCGAG 
ACCGCCCATG AGATCGTCGA ACGCGCCCTG GGCCTGTCGA AGGCGGACGG CTGCGTAGTG
ATCGCCACCG AGTCCAGCGC GGTGAACCTG CGCTGGGCCA ACAACACGTT GACGACCAAC
GGCGCCAGCC GGGATCGGTC GATCACCGTC ATCAGCGTCA TCGGCCGCTC GTTCGGGGTG
CGGACGGCCT CCACCATCGA CTCCCCCGGT TCCGGCGGCC ACCCCACCGC GCGTCCCACC
GACCTGCCGC ATCCCACCGA TGCCGACGGC CTCGCGGAAC TCGTCCGCGC CGCCGAGGAC
GCGGCCCGGG ACGCCGAGGA TGCCGAGGAC TACACGGACC TGCTCGGGCT GGACACGACG
GACGTCGGCG CGGGTGCGGG GTCGGCGGGT GCGGGGTCGG CGGGTGACGC GGCCGAGGCC
TTCACCGATC CGGCCGCCCG GACCAGCACC ACGGTGTTCG CCTCCTTCGC CCGGGACCTC
GCGGAGGCGT TCGCCGCGGC GCGGGCGGGC GGGCGGCGGT TGTTCGGCTT CGCCGAGCAC
AATCTGACCA CCACGTGGCT CGGGACGTCG ACGGGGCTGC GGCTGCGGTA CAGCCAGCCG
ACCGGCAGCG TGGAGTGGAA CGCCAAGAGC GGCGTTCCCG GCGGCTCGGT CTGGCACGGC
CAGTCGACCC GTGACTTCAC CGACGTCGAC GTCGCGGGTA CCGACGCGGC GCTGCGCGAC
CGGCTGGCCT GGTGCGAACG GTCGCTGGAA CTGCCGGCCG GCCGGTACGA GACGCTGCTG
CCGCCCTCCG CGGTGGCGGA TTTGATGATC TACATGTACT GGACGGCTGC CGGTCGGGAC
GCGGCCGAGG GCCGGACGGT CTTCAGCCGG GCCGGCGGCG GCACGCGTCT CGGCGAGGCG
ATCGGACCGG CGGGACTGCG GCTGGCCAGC GATCCGCACG ACCCGGAACT CGCGACGACC
ACCTTCGTGA CCGCGCAGTC CTCCTCGTCG ATGTCCAGCG TCTTCGACAA CGGACTGGCG
CTGAGGCCCA CCGACTGGAT CTCGGATGGC ACCCTCGCCG CCCTCGTGGA GACCCGTGCG
TCCGCCCGTG CCACCGGGGT CCCGACCACC CCGATGATCG ACAATCTGAT CCTGGACGGC
GGAGGCTCCG CGTCGTTGCA GGAGATGATC GCCTCGACGA AACGCGGGCT CCTGCTCACC
AGCCTGTGGT ATATCCGCGA AGTCGATCCC GAGGTGCTCC TGCTCACCGG CCTCACCCGG
GACGGGGTCT ACCTGGTGGA GAACGGTGAG GTCACCGGGG CCGTCAACAA CTTCCGCTTC
AACGAGTCAC CGGTCGACCT GCTCGGCCGC CTGGCCGAGA TCGGCGCCAC CACCCGCACC
ATGGCGCGGG AATGGGCGGA CTGGTTCACG CTCACCCGCA TGCCCGCGGT ACGGATCCCC
GACTTCAACA TGTCCTCGGT CAGCCCGGCG AACTGA
 
Protein sequence
MSRPAGPQPA SADGGSTVRE TAHEIVERAL GLSKADGCVV IATESSAVNL RWANNTLTTN 
GASRDRSITV ISVIGRSFGV RTASTIDSPG SGGHPTARPT DLPHPTDADG LAELVRAAED
AARDAEDAED YTDLLGLDTT DVGAGAGSAG AGSAGDAAEA FTDPAARTST TVFASFARDL
AEAFAAARAG GRRLFGFAEH NLTTTWLGTS TGLRLRYSQP TGSVEWNAKS GVPGGSVWHG
QSTRDFTDVD VAGTDAALRD RLAWCERSLE LPAGRYETLL PPSAVADLMI YMYWTAAGRD
AAEGRTVFSR AGGGTRLGEA IGPAGLRLAS DPHDPELATT TFVTAQSSSS MSSVFDNGLA
LRPTDWISDG TLAALVETRA SARATGVPTT PMIDNLILDG GGSASLQEMI ASTKRGLLLT
SLWYIREVDP EVLLLTGLTR DGVYLVENGE VTGAVNNFRF NESPVDLLGR LAEIGATTRT
MAREWADWFT LTRMPAVRIP DFNMSSVSPA N