Gene Francci3_1623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1623 
Symbol 
ID3905902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1950423 
End bp1951550 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content69% 
IMG OID637878961 
Productpeptidase M4, thermolysin 
Protein accessionYP_480728 
Protein GI86740328 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTGCC CCCGCCACGC CCTGCCCTGC GCGGTCCCAC CCCACATCCT CGAACGCATT 
GTCCGCAACG GCACCGAGGA GCAGCGTTCC CGGGCCCTGT CCACCCTGCT CCAGGACACC
TCTCATCGCA CCATCCGGGT GCACAACGCC CTGATGCGCT CGACGCGGCG CGGTGCCGTC
CCACCGCGCC CGGCGCCCGA GACCGGCCCG CGGCGTACCG TCAGCGACGC GGGCGGCACC
GAGATCCTGC CGGGTGCGAT GGTTCGCGAG GAGGGTGGCG CCGCGATCGA CGACGCGGCG
GTCAACGAGG CCTACGACGG CCTGGGCTCG ACGTTCGCGT TCTACTCCGA CGTCCTAGGC
CGGAGCTCCA TCGATGACGA GGGGATGGCT CTGCTCGCCA CCGTGCACTA CGGCGACCAC
TACGAGAACG CCTTCTGGAA CGGCCGCCAG ATGGTGTTCG GCGACGGCGA CGGCGAGCTG
TTCAAGCGTT TCACGGCGTC GCTCGACATC ATCGGTCATG AACTGACCCA TGGCGTCACC
GAGGACGAGG CGGCGCTGAT GTATGTGAAC CAGTCCGGCG CGCTCAATGA GTCGATCAGC
GACGTCTTCG GTTCCCTGGT GAAGCAGTAC GTCCGCGGCC AGACCGCCGA GCAGGCCGAC
TGGCTGATCG GCGACGAGCT GCTCACAGAC GCCGTCCAAG GCGTCGCGCT GCGGTCGATG
AAGGCCCCCG GGACCGCCTA CGACGATCCG GTACTCGGCG ACGACATCCA GCCGGACCAC
ATGGACCGCT ACGTCCGGAT GACCGCCGAC AACGGCGGCG TCCACATCAA CTCGGGCATC
CCGAACAAGG CGTTCTACCT CGCCGCGACG GCTCTCGGCG GATACGCCTG GGAGAAGGCC
GGCCGCATCT GGTACGAGAC CCTGCGCGCA CCGCAGATCC GGCCGAACAC GACGTTCCGT
GCATTCGCCT CGGTGACCGT GCACCAGGCC GGCCTGCTGT TCGGTGCGGA CGCGCGGAAG
GCCGTCTCCG AGGCGTGGCA GGCCGTCGGC ATCGTCGTCC GCGCAGCCGG GCAGGTCAGC
GCAGCCGGGC AGGTCAGCGC AGCCGGGCAG GTCAGCGCAG CCGGGTAA
 
Protein sequence
MPCPRHALPC AVPPHILERI VRNGTEEQRS RALSTLLQDT SHRTIRVHNA LMRSTRRGAV 
PPRPAPETGP RRTVSDAGGT EILPGAMVRE EGGAAIDDAA VNEAYDGLGS TFAFYSDVLG
RSSIDDEGMA LLATVHYGDH YENAFWNGRQ MVFGDGDGEL FKRFTASLDI IGHELTHGVT
EDEAALMYVN QSGALNESIS DVFGSLVKQY VRGQTAEQAD WLIGDELLTD AVQGVALRSM
KAPGTAYDDP VLGDDIQPDH MDRYVRMTAD NGGVHINSGI PNKAFYLAAT ALGGYAWEKA
GRIWYETLRA PQIRPNTTFR AFASVTVHQA GLLFGADARK AVSEAWQAVG IVVRAAGQVS
AAGQVSAAGQ VSAAG