Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0423 |
Symbol | |
ID | 3903247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 503262 |
End bp | 504641 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637877754 |
Product | NLP/P60 |
Protein accession | YP_479539 |
Protein GI | 86739139 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCAGTG TCCCCACGGA GGGCTGGCAT AGCGCCGGCG AACCGCGCCC CGGGCCCCGC CGTCCCCGTG ATGTCCCGCC ATCCGGGTTC GGCCGGCGAA CTTCGAAACA CGTCGCAAGG CAGACTCCCG CCGCCGGCCC GACCGACGAT CCCCAGCCGA CCGGGAACGA TCCGACCGGA TGGCGTCCGA AAGCCGGAGC GCCCATTGTT CTCGGACTGG TCGGCATTCT CGTCGCCGGA CCTCCGGTTG TCTGGGGGTT TGCCGGATCC GCCTGGGCGG CGCCGACGAC TCCGGCCTCC GGGTCGGCCC CGGAGAGCCC GGAGAGCCTG CAGAGCCTGC AGGCGGAGAT CAACGGTACC CGGGTACGGC TGGACGAATC CACTCGTCAG ACGGCGATTG CCACTGAGGT GTTCAACGCC GAACGTATTC GGCTGGCCGA GGCCGAGCGG GCCGCGGCTG CCGCGGCCGG GCGGGTCGAC CGCGCTGACG ACGCTGTACG GCAGGCCTCG GACAAACACC GGGGGTTGGC GGTGTCGGCG AATCGGGCCG GGGGATTCGG GCAACTGTCG TTGCTGCTGA CCGGCGACCC CCGACAGGTA CTCGACCGGG CCGGTGCGGT CGATGCGCTG GCCCGCCGGC AACGCGTGGC GGACACTGGC CTGCGGCTCG CGCGTCGGGA TCTCACCGAG GCACGCCGGA GCGCTGACGT GGCACTTGCG GGCAAGAGGA AGATCGTCAT GCGGCTCGCC GCACGTAAGC GGTCCATCGA GGCGTCCGCC GCCGAGCAGC GCAGCCTGCT GCAGCGGCTC GAATCCCGCT ACGCATCGCT GGAGCGGCGG GCCAGGGAGC GCCAAGCCGC CGCCGCGCGG GCCCGCAGGG CGGCGGCGGC CGCCGCCGCG GCATCGGCGG CCAGGAAGGC CGCGGCCGAA CGGGTCCGTT ATCGGAAGGA GTCGGCCGCG GTCGCGGCCG CGGGCCGGGC GTTCGCCGCG GCCCCGACTA CCCCGGCACC CATCCCGCCG ACGGGCGGCG GTGGCGCGTC GCGCGCGGTG CAGGAGGCAT ACGCCCAGCT GGGCAAGCCC TACGTGTGGG CTGCGGCGGG GCCGAAGTCC TTCGACTGCT CCGGCCTGAC GCAGTGGGTC TGGGGGAAGG CCGGGGTCTC GCTGAGCCAC TACACCGGAT CACAATGGAA TGAGGGGCGC CGCGTGAACC GGGCGGGCCT CATTCCCGGC GATCTCGTCT TCTTCCATGC CGATCTTGAT CATGTCGGGA TCTACATCGG GGGCGGGAAG ATGATCCACG CTCCGCGGAC CGGGGAGGTG GTCAAGGTGG AGAAGATCTG GTGGTCGAGC TTCCGGGGGG GTGTGCGGCC GGGAGCGTGA
|
Protein sequence | MSSVPTEGWH SAGEPRPGPR RPRDVPPSGF GRRTSKHVAR QTPAAGPTDD PQPTGNDPTG WRPKAGAPIV LGLVGILVAG PPVVWGFAGS AWAAPTTPAS GSAPESPESL QSLQAEINGT RVRLDESTRQ TAIATEVFNA ERIRLAEAER AAAAAAGRVD RADDAVRQAS DKHRGLAVSA NRAGGFGQLS LLLTGDPRQV LDRAGAVDAL ARRQRVADTG LRLARRDLTE ARRSADVALA GKRKIVMRLA ARKRSIEASA AEQRSLLQRL ESRYASLERR ARERQAAAAR ARRAAAAAAA ASAARKAAAE RVRYRKESAA VAAAGRAFAA APTTPAPIPP TGGGGASRAV QEAYAQLGKP YVWAAAGPKS FDCSGLTQWV WGKAGVSLSH YTGSQWNEGR RVNRAGLIPG DLVFFHADLD HVGIYIGGGK MIHAPRTGEV VKVEKIWWSS FRGGVRPGA
|
| |