Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4292 |
Symbol | |
ID | 3907260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 5122032 |
End bp | 5123216 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637881619 |
Product | colicin V production protein |
Protein accession | YP_483367 |
Protein GI | 86742967 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCAACG TCCTGGACAT CGTTCTTCTC CTGCTCGTCG TACTCTTCGC GATCTCCGGG TACCGGCAGG GCTTCCTGGT GGGCGCGCTG TCGTTCGTCG GCTTCCTCGG CGGTGGTGTG CTCGGTGCCA AGGTTGCGAA GCCCTTCGCC GAACTGATCG GCCAGGAGAG CCACGGCGCC ATGGTCGGTC TCCTGGCCGT GCTCGGTCTC GCCCTGGTCG GGCAGGTCGC GGGCACCGCG ATCGGGGTCG CCAGCCGGGA CCGGGTGACC TGGCGCCCCG GCAGGGCCGT CGATGCCGCC GCGGGTGCGG TGCTCTCCGG CTTATCGGTG CTTCTTGTCG CCTGGCTGCT GGCGACGGCC GTCGATCGAT CGCCCTTCGA GTCCCTCGCC CACTCGGTCC GGGAATCGCG GATCCTCGCC ACCGTGGACG CCGGGATGCC CAACGAGGTC CGCAGTGCAT TCGCCGACCT GCGCCAACTG GCTGACGACA ACGGCTTCCC CGAGGTCTTC GCCGGGCTCG GCGGCGGGCG CATCGTCGCT GCCGATCCGC CCGATCCCGC TTCCACCCGG ACTGCCGGGG TCCGCAACGC GGCGGCGAGC ATCGTGAAGA TCCGTGGCGT CGCCAGCTCC TGCGATAAAC GGGTCGAGGG ATCCGGGTTC GTCATCGCCC CGCAGCGCGT GATGACCAAC GCGCACGTCG TAGCCGGAGT GCACCGTCCG GTGGTCGTGC TGGCCTCGCG CACCCTGCCT GCCGAGGTCG TCCTGTTCGA CCCCGACCGG GACGTCGCCG TGCTCCGCGT GCCGGACCTG CAACGGCCGC CGCTGCGGTT CCGGACAAAC CCACCGGCCG CGGCCGAAGA CGCCGCAGTG ATCGCCGGTT ATCCCCAGGA CGGGCCATAC ACCACTGTGG CGGCGCGCGT CCGCAACCAT CAGACCGCCC AGGCCCCGGA CATCTACTCG CACGGCCTCG TCCTGCGCGA CATCTACGCG GTCCGTGGAC GGGTGTTGCC GGGAAACTCC GGTGGACCCC TGTTGTCCGA AAGCGGAGAC GTCCTGGGCG TCGTGTTCGC CGCGGCCGTC AACGACAGCG ACACCGGTTA CGCCCTGAGC GCCGCCGAAG TGGCCAGGCC TGCCGCCGCC GGGGTGATCG CCACGGCGGA GGTGAGCACT CAGGGGTGCG ACTGA
|
Protein sequence | MLNVLDIVLL LLVVLFAISG YRQGFLVGAL SFVGFLGGGV LGAKVAKPFA ELIGQESHGA MVGLLAVLGL ALVGQVAGTA IGVASRDRVT WRPGRAVDAA AGAVLSGLSV LLVAWLLATA VDRSPFESLA HSVRESRILA TVDAGMPNEV RSAFADLRQL ADDNGFPEVF AGLGGGRIVA ADPPDPASTR TAGVRNAAAS IVKIRGVASS CDKRVEGSGF VIAPQRVMTN AHVVAGVHRP VVVLASRTLP AEVVLFDPDR DVAVLRVPDL QRPPLRFRTN PPAAAEDAAV IAGYPQDGPY TTVAARVRNH QTAQAPDIYS HGLVLRDIYA VRGRVLPGNS GGPLLSESGD VLGVVFAAAV NDSDTGYALS AAEVARPAAA GVIATAEVST QGCD
|
| |