Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0373 |
Symbol | |
ID | 3903424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 438654 |
End bp | 439643 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637877702 |
Product | hypothetical protein |
Protein accession | YP_479489 |
Protein GI | 86739089 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.413913 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCAT CGACGGACCG CCGAACGATC CTTACCACGA GCGCGCTACT CGGTGTCCTC GGAGCCGCCG AGGCGGTCGA CATGACATCC TCGGCGATGG ACAGCCGCGC AGCGCTACAC CAGGTCGAGG CCGGTCTGGA GTCCCTCGCC CGCCGCTGGG AGACCACCGC GCCGGCCGAC TTGGTCGAGC GCCTCGCGAC GCTGGAGCAC CATGCGCGGA CGATCGGCCG ATGGCGGTTG GGCTCAGCCC TGCGCCGGGA CTGGATACGA GCGCATGGCC GGATTCTTCT CATGACCTCC GTGGCACAGG GTGACAACGG CATGGCCAGG CCGGCCACAG TCTCGGCGCG GGCGGCGCTC GTCCTCGCGA AACATGTAGG AGACGCGACG ACCGCCGCCC ACGCCGGAGT CGTGTTGGCT GAACTCGCCG CTTATTCCTT CCAGAGCACC CGCGACGGGT TGGCGCTGGC GCGGGCGGCC CAGGCCACGG CACCACACGC GCACACCGCG GTCCTGGCGA TGACCACCGA GGCGCACATC ATGGCCGCGT GCCACCTGCC TACCGACGAT ATCGTCGCCG TGCTGCGCTC AGCGGGATCG GTCGCGGCAA AGCTCCCGGC AGGTGCGGTG GGTTACAGCC TGGACGGTAT CCATCCTGGG TATCTGCCGA CATTCGGCGG TGCGGCACTC GTCGCCGCCG GCGCATTGGA CGAGGGAAAT GAGCGGCTGA GCGAGGCAGC CGAACTGTTC GACCGTTCCC GGGCGTCCGG TGCTCTCGCC GCCGTCCGCC TTTACCAGAC ATCCGCGGCG ATGCGCGCAC GCGAGCTGGA CAAGGCCGAG ATCCTCGCGA CCAGAGCGCT CGCCGCATCC GCCGTGCGGC CGAGCAGCTG GCTGTCGAAC GGAATCCTCT TTCTCGCCGA ACGCGCCCGC AGCCAGGGCG CGGACTGGTC CGGACTCGTC ATCCAGGCCC AGGAGTGGTC CCCGCTCTGA
|
Protein sequence | MDASTDRRTI LTTSALLGVL GAAEAVDMTS SAMDSRAALH QVEAGLESLA RRWETTAPAD LVERLATLEH HARTIGRWRL GSALRRDWIR AHGRILLMTS VAQGDNGMAR PATVSARAAL VLAKHVGDAT TAAHAGVVLA ELAAYSFQST RDGLALARAA QATAPHAHTA VLAMTTEAHI MAACHLPTDD IVAVLRSAGS VAAKLPAGAV GYSLDGIHPG YLPTFGGAAL VAAGALDEGN ERLSEAAELF DRSRASGALA AVRLYQTSAA MRARELDKAE ILATRALAAS AVRPSSWLSN GILFLAERAR SQGADWSGLV IQAQEWSPL
|
| |