Gene Francci3_4494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4494 
Symbol 
ID3907470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5366291 
End bp5367436 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content71% 
IMG OID637881826 
Productpeptidase M50 
Protein accessionYP_483569 
Protein GI86743169 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGGCGA CCTTCGTGGT GGGACGGATC GCTGGAGTCC GGATCGGGGT CCACTGGAGC 
GTGCTGCTCA TCTTCGGCAT CATCGCGTTC GGCCTCGCGC AGGGCCGCCT CCCACAGGCC
TACCCGGGCC ATGCCCTGGT GGTGTACTGG GTGGCGGCTC TTGCCGCCGC AGTGGTTTTC
TTCGCCTCGC TGCTCGCCCA CGAGGTGGCG CACGCCGTGG TGGCCCAGCG CAACGGGGTG
GCCGTGGACG ACATCGTGTT GTGGCTGCTG GGCGGGGTGG CCCGGCTGAA GTCGGAGGCG
TCGAGCCCGG CAGCGGAGCT GCGGATCGCT GGTGTCGGCC CACTCGTCAG CCTCTTGCTG
GGCGGGCTCT TCGTGCTGGG CGCCTGGCTG CTCGCCCTGG CGTCCGCGCC CGAACTCCTG
ATCGAGGTGG TGGCCTGGCT GGCGGGCATC AACCTGCTGC TCGCCGTCTT CAACGCCTTT
CCCGCCGCTC CGCTCGACGG TGGGCGGCTG CTGCGCGCCT TCCTGTGGTG GCGTACGGGA
GATCGGCTGC GGGCGACCGC CGGGGCCACC GCGGCCGGAC GCGTCCTCGG CTGGCTGCTC
GTCGTTCTGG GACTCCTCGT GTTCATGAGA GGCGGCGGAT TCGGCTGGCT CTGGCTGGCC
CTGATCGGCT CGTTCCTCAT CGCGGCCGCC ACCGCCGAGG GACGGCAGGC GCAGTTGCGC
GGTGTGCTCG CCGGCGTCCC GGTACATGAC GCCATGACGA CGAAACCGCT CACGGTGCCC
GCGGCCCTGA CCGTCGCGGA CCTGCTGGCC GGCCCGCTGT ACCGGTACCG GCACTCGGCG
TTCCCGGTGA CCGGCGAGAA CGGAGCCCCG GTCGGGCTGG TGACCCTGGA CGGCGCCAAG
CAGGTGCCGC CGGAGAAGAG CGGCACGGTA ACGGTAAGCG AGGTGATGGT GCCACTGTCG
CGGACCACCA TCGCGGGTCC CGACGACCCG CTGGCGGATC TGCTGCCGCG CATGGAGCCC
GGCGCCGAGC ACCGCGTCCT GGTGATGGAT CAAGGCAGAC TCGTCGGGAT CCTGTCCCTG
TCGGACATCA GCCGCACGGT GACGTGGCTG ATGAACGCCG CCCCCGGGCC GCGCGAAGTC
CCGTGA
 
Protein sequence
MRATFVVGRI AGVRIGVHWS VLLIFGIIAF GLAQGRLPQA YPGHALVVYW VAALAAAVVF 
FASLLAHEVA HAVVAQRNGV AVDDIVLWLL GGVARLKSEA SSPAAELRIA GVGPLVSLLL
GGLFVLGAWL LALASAPELL IEVVAWLAGI NLLLAVFNAF PAAPLDGGRL LRAFLWWRTG
DRLRATAGAT AAGRVLGWLL VVLGLLVFMR GGGFGWLWLA LIGSFLIAAA TAEGRQAQLR
GVLAGVPVHD AMTTKPLTVP AALTVADLLA GPLYRYRHSA FPVTGENGAP VGLVTLDGAK
QVPPEKSGTV TVSEVMVPLS RTTIAGPDDP LADLLPRMEP GAEHRVLVMD QGRLVGILSL
SDISRTVTWL MNAAPGPREV P