Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0285 |
Symbol | |
ID | 3903028 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 330380 |
End bp | 331486 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637877614 |
Product | NLP/P60 |
Protein accession | YP_479401 |
Protein GI | 86739001 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCGCGC AAAGCGTGCA GCTCGACGAA GAAACGCGAG GCGCCGGTCG CGGCCGTCAC CGCGCACCCT CCGCGCCCCC GGCCGCCCGC AGCCGGGCCC GGGCCCGCGC CTTCGCCGCC GTGACCACCG GGACCGTCGC GGTCTCCGGA GTGGCTCTCG CCGGCTGCGC CACCGACATC AACGCCGACG CCGCGAAGGA CGAGCAGCCG AACACCGCTC CGATCACCCT CGCCGCGCAG ATCGGCTCCC AGGCCGCCCT GGGGGGCAGC ATCGCCGCCG CGGCGGTCAG CACCCACGGC TCCACCTCCC TGCTCGGCTC GACGGGCGGT GTCGACGCCC CGCCCCTGGC CGGCAAGGTT CAGGTCGGTC TCCGGGTGAC CAACCCCGAC GTGAGCGTCA GCGCGGACCA GCCGATCGAC ATCGGCTTCT CGCTCTACAA CGAGCAGACC CACGAACCGC TGGCGAACCA GCTGGTCAAG GTGCAGGTGA AGCTCCCCAC CGGGTGGGCC ACCTTCAAGC ACCTTTACAC CAACGCCCAG GGCTACGCCT CCTACACGGC CCGGGTGCTC ACCACCACGA ATGTCACCGC GGTCTTCGAC GGCACGGACG CCCTGCAGTC CGCCCGCTCG GCGAACGACG CCACCCTGCG CGTACGGCCG GCGCCCACCC CCCGGCTCGT CCGCAACGCC GCCTGGTCGG ATCTCCTCAC GACAGGGGAC GCCGCGGACC AGGCGTCGGT TCCGGTCCCG TCGAGCTCCC TCGGGGAGAA GGCCGTCTAC CTGGCCTCCC TGCAGGCCGG CAAGCCCTAC GTCTACGGTG CGGAGGGCCC GAGCTCGTTC GACTGCTCCG GCCTCATCCA GTACATCTTC AAGCAGCTCG GCAGGAGCGT GCCGCGGACG ACCGACGCCC AGTTCGCCGC CGCCACCCGG GTGTCGCAGT ACAACAAACA GCCCGGCGAC CTGATCTTCT TCGGGACACC CGGGAACATT TACCACGTCG GCATCTACGC GGGCGACGGC ATGATGTGGG CGGCGCCGCA CACCGGCGAC GTCGTGTCGC TCAAGAAGAT CTATACCACC TCCTACTACG TCGGTCGGAT CCTCTAG
|
Protein sequence | MSAQSVQLDE ETRGAGRGRH RAPSAPPAAR SRARARAFAA VTTGTVAVSG VALAGCATDI NADAAKDEQP NTAPITLAAQ IGSQAALGGS IAAAAVSTHG STSLLGSTGG VDAPPLAGKV QVGLRVTNPD VSVSADQPID IGFSLYNEQT HEPLANQLVK VQVKLPTGWA TFKHLYTNAQ GYASYTARVL TTTNVTAVFD GTDALQSARS ANDATLRVRP APTPRLVRNA AWSDLLTTGD AADQASVPVP SSSLGEKAVY LASLQAGKPY VYGAEGPSSF DCSGLIQYIF KQLGRSVPRT TDAQFAAATR VSQYNKQPGD LIFFGTPGNI YHVGIYAGDG MMWAAPHTGD VVSLKKIYTT SYYVGRIL
|
| |