Gene Francci3_2633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2633 
Symbol 
ID3906306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3107056 
End bp3108273 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content73% 
IMG OID637879958 
Productpeptidase M50 
Protein accessionYP_481724 
Protein GI86741324 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.962625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.303399 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGATC AGCACGGCGG CGCGGGTTCG GGCGGCGCGG GTTCGAGCGG CGCGGGTTCG 
AGCGCCCACG GCGGCGCGGG CCCGGCGGGG CAGGCTCCGA GTGAGCGACC CCCCGGCGTG
CCGGTCGGGC GGATCCGCGG TGTTCCCATC GTCATCTCGC CGTTCGCCCT CGTCTTCGCC
GTCCTCGTCG CCTACCTGCT CTCCGGCTCG ATCCGCGACC GGCTGCCGCT GGCGTCCGAT
GGGCGGATCC TCGCGCTCTC CTCGTTGATC TCTATCGGCT TCCTGGCCTC CCTGCTCGCC
CACGAGATCG GTCACGCTCT GACGGCGCTG GCGTTCGGTC ACACCGTGCG GTCCGTGACC
CTGCACGGCT TCGCCGGGTT CACCGAGTTC GAGCCGGAGC CCCGCAGCGC CGGCCGCGAG
TTCCTGATCG CCTTCGTCGG CCCGGCGGTC AACGGGGTGC TGGCCGCCGG CTGTCACCTC
GGCCTGCTCG GCCTCGACGA CACCAGCGAC GCCGCCGCGG TCCTGCACGA TCTCGGACTC
ATCAACGCCG CGTTGTTCCT CTTCAACCTG GCACCGGGCC TCCCGCTGGA CGGTGGACGG
GTGGTCGTCG CCGCGGTGTG GGGTCTGACG CGCGACAAGC TGCGGGGGCT GCGGGCCGGT
GCCTACGGCG GGTTCGTCGT CGCCGCCGGC CTGGTCGTCT GGGGTGCGTC GACCTCCGAC
GGCATCGGCA TGGTGTACAC CTATGCTCTG GCGGGCTTCC TCGCGTTCGC GGCCTACCAG
TCGCTGCGCG CCGCGCAGGT GCGGGAGCGG CTGCCCGGCC TGTGCGCCGG TCGCCTTGCG
CGCCGGACGC TGCCCGTCGA GGGTGCCGTT CCGCTGGCGG AGGCGCTGCG GCGAGCTCAG
GAGGTCGGCG CCACCGCCGT CGCGGTGATC GACCGTGACG GCAGCCCCCT GAAGATTATG
AATGGCTCCG CGGTCGACGC GCTGCCGGAG CATCGGAGAC CCTGGATGAC CGTGGATGAA
GTGAGTCGGG TGATCTCGCC CGGCATGGTC CTCGACGCCG ATCTGGAAGG CGAGGCCCTG
CTGGCGGCCG TGCAGCGGGT GCCGGCGTCG GAGTACCTCG TCAAGCAGGC GGGCCGCCCG
GTCGGCGTGC TCGCGATGGT GGATCTCGTC GCCCGTATCG ATCCGGCCGC CGCCGCCCGC
ATGGTGGCGT CCCGGTGA
 
Protein sequence
MADQHGGAGS GGAGSSGAGS SAHGGAGPAG QAPSERPPGV PVGRIRGVPI VISPFALVFA 
VLVAYLLSGS IRDRLPLASD GRILALSSLI SIGFLASLLA HEIGHALTAL AFGHTVRSVT
LHGFAGFTEF EPEPRSAGRE FLIAFVGPAV NGVLAAGCHL GLLGLDDTSD AAAVLHDLGL
INAALFLFNL APGLPLDGGR VVVAAVWGLT RDKLRGLRAG AYGGFVVAAG LVVWGASTSD
GIGMVYTYAL AGFLAFAAYQ SLRAAQVRER LPGLCAGRLA RRTLPVEGAV PLAEALRRAQ
EVGATAVAVI DRDGSPLKIM NGSAVDALPE HRRPWMTVDE VSRVISPGMV LDADLEGEAL
LAAVQRVPAS EYLVKQAGRP VGVLAMVDLV ARIDPAAAAR MVASR