Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2662 |
Symbol | |
ID | 3904886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3142594 |
End bp | 3143664 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637879987 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_481753 |
Protein GI | 86741353 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0480971 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCCGG AGCAGGGCCC GGTCGTGCTG GGCATCGAGA CGTCCTGCGA CGAGACCGGG GTGGGGCTCG TCCGGAACGG CACGCTGCTG GGCGAGGCCC TGTCGACGAG CATGGACCAG CACGCCCGCT ACGGCGGGGT GGTCCCCGAG ATCGCCGCCC GGGCCCACGT GCAGGCGCTG GTGCCCTGCG TGCGCGCGGC GCTGTCCTCG GCGGGGCTGT TCGTGGCGGA CATCGGCGCC GTCGCGGTCA CCGCCGGCCC CGGCCTCGCC ACCGCCCTGC ACGTCGGGGT GGCCGCGGCG AAGGCGTACG CCACGGCGCT CGATGTTCCC CTCTACGGCG TGCACCATCT CGCCGGTCAC CTCGCCGCGG ACCTCGTCGA CGGCGAACCG CTACCCGATC CCCTCATCGC CCTGATCGTC TCCGGCGGGC ACACGTCGCT GCTGCGGGTG GGGGACCTCG CTCGCGACCC GATCACCCAC CTCGGCGACA CGCTTGACGA CGCGGCCGGG GAGTGCTTCG ACAAGGTCGC CCGGGTGCTC GGCCTGCCCT ATCCGGGCGG TCCCGCGGTC GACCGAGCCG CGGTCGGCCA CGATGCGACG GCGCTGGCCT TCCCCCGGCC GCTGACCGGC CGGGCGGACG CGCCCTACAC CTTCTCGTTC TCGGGGCTGA AGACCGCCGT CGCCCGATGG GTCGAGTCGC ATCCCGACTC CCCCGTACCG GCCGGCGATG TGATCGCATC CTTCCAGGAG GCAGTCGTCG ACGTGCTCAC CGCCAAGGCG GTCCGTGCCT GCCTCGACCA CGGGATCGGT GACCTGCTCA TCGTCGGCGG GGTCGCGGCG AACAGCCGGC TGCGGGCGCT GGCGGCCAGC CGCTGCGAGC AGACCGGCAT CCGGCTGCGG ATACCGGCCC GCCGGCGGTG CACGGACAAC GGCGTGATGA TCGCGGCGTT GGGTGACCTG CTCGTCCGCG CCGGCGCCGA GCCCTCCCCC GCCGAGCTCA CCGCCATGCC GGGCGCGTTC CTCGAACGGG CCCAGCTCGG CACCGCGCTG CCGGCCCTGC ACGCCGCGTG A
|
Protein sequence | MPPEQGPVVL GIETSCDETG VGLVRNGTLL GEALSTSMDQ HARYGGVVPE IAARAHVQAL VPCVRAALSS AGLFVADIGA VAVTAGPGLA TALHVGVAAA KAYATALDVP LYGVHHLAGH LAADLVDGEP LPDPLIALIV SGGHTSLLRV GDLARDPITH LGDTLDDAAG ECFDKVARVL GLPYPGGPAV DRAAVGHDAT ALAFPRPLTG RADAPYTFSF SGLKTAVARW VESHPDSPVP AGDVIASFQE AVVDVLTAKA VRACLDHGIG DLLIVGGVAA NSRLRALAAS RCEQTGIRLR IPARRRCTDN GVMIAALGDL LVRAGAEPSP AELTAMPGAF LERAQLGTAL PALHAA
|
| |