Gene Francci3_3863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3863 
Symbol 
ID3906631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4625704 
End bp4626708 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content74% 
IMG OID637881189 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_482942 
Protein GI86742542 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.593328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.848378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGACCA GACGTGCCAG GGGAGACGGA CCCGACCCGA CGCGGCCCGG TGGTTCCTTG 
CCGGTCGAGC CGCTGGACGC CTACTCCGCG GTCGTGACCC GGGTGGCCGC GGCGCTGACC
CCGTCAGTGG CCAGCCTACG GGTGCGTACC CGGCGCGGCG CGGGAGCCGG CTCCGGGGTG
GTGTTCACCG ACGACGGGTT CCTGCTGACC TCCGCGCACG TCGTCGAGGG GCATCGGGCC
ATCTCCGGAG CATCGGTCGG GCTGGCGCAG TTCGCGGACG GCACCGAGCG CGAGGTCGAC
TTGGTGGGTG CCGATCCGCT CTCGGACCTG GCCGTGCTGC GGGCCCGGGG CACGACCCCG
CCGGCTGCCG TCCTCGGGGA CGCCGCGGGT CTGCGGGTCG GCCAGCTCGT CGTGGCCGTC
GGCAACCCGC TGGGGCTCAC CGGCAGCGTG ACCGCCGGGG TCGTCAGCGC CCTCGGCCGG
TCGTTGCCGA CCCGTTCGGG CTCGGCCGTG CGGGTGGTCG ACGAGGTGAT CCAGACCGAC
GCGGCTCTCA ACCCGGGTAA CTCCGGCGGG GCGCTGGTCA CGGCCGACGC CCGGGTGGTG
GGAGTGAACA CCGCCGTCGC CGGTGTCGGC CTGGGGCTGG CTGTCCCGGT GAACGACACG
ACGAGGAAGA TCCTCGCCGC GTTGATGCGC GACGGCCGGG TCCGCCGGGC GTACCTGGGG
GTCGCGGGGG CGGGTGTTCC GTTGCCACCC GCCGTGGCCG AGCGGATCGG GCAGCGCCAC
GGTGTCTGGC TGGCCGAGGT GGTCGTCGGC AGTCCGGCCG GGATCGCCGG GCTGTTCACC
GGGGATCTCG TCCTGTCGGT GGCCGGGACG CCCGTCGTCG CTCCGGGTGA TCTCCAGCGG
CTGCTGACCG AGGGCACGAT CGGCCGGCCA GTGGAACTCA CGGTGTGGCG GCGGGGCGCG
TTGGTGGACG TGATCGTCGT ACCTGCCGAG TTGGTGATCG CCTGA
 
Protein sequence
MRTRRARGDG PDPTRPGGSL PVEPLDAYSA VVTRVAAALT PSVASLRVRT RRGAGAGSGV 
VFTDDGFLLT SAHVVEGHRA ISGASVGLAQ FADGTEREVD LVGADPLSDL AVLRARGTTP
PAAVLGDAAG LRVGQLVVAV GNPLGLTGSV TAGVVSALGR SLPTRSGSAV RVVDEVIQTD
AALNPGNSGG ALVTADARVV GVNTAVAGVG LGLAVPVNDT TRKILAALMR DGRVRRAYLG
VAGAGVPLPP AVAERIGQRH GVWLAEVVVG SPAGIAGLFT GDLVLSVAGT PVVAPGDLQR
LLTEGTIGRP VELTVWRRGA LVDVIVVPAE LVIA