Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4152 |
Symbol | |
ID | 3907117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4951639 |
End bp | 4952730 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637881480 |
Product | hypothetical protein |
Protein accession | YP_483229 |
Protein GI | 86742829 |
COG category | [S] Function unknown |
COG ID | [COG5563] Predicted integral membrane proteins containing uncharacterized repeats |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) [TIGR02913] probable extracellular repeat, HAF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGAA CAAGCACGTC ACCCCGAGTT GCCGGGACGA CGCGGCGACG GCGAACCCGC CGCGTCCGCG GTTTTCGCGC GATCGGTGCT GCGGTGGTGG CGCTGGCCCT CCCGCTGGTC AGCGCCGGAC CCTCGTCGGC CGGCACCGGC GGGTCGGTGA TCGACCTCGG TACCCTGCCC GGCGGCAGCA GCAGCGGCGC CTACGACGTG AACAACAGCG CGGTCGTCGT CGGCTACTCG GCGAGTGCCA GCGGCCGCAA CCACGCCGTC CGGTGGAACA GCCTGGGTGT ACCGACCGAC CTCGGCACCC TGCCCGGGGA CGAGGCGAGC GTCGCCTTCA CCATCAACGA CGCTGGCACC GTGACGGTTG GCGCCTCCCT GATTCCCGGT GGCCCCACCC ATCCGGTGGA ATGGGACGCG GCTGGCCAGA TCACCGCGCT GACGACGCCT GCCGGGAGCA TTCTGAGCCG GGCCTATGCG GTCAACAACC AGGGCACGGT CATCGGCTTC TGGAGTGGGC CGGACCGGTT GTACCACGCG CTGCGGTGGA CCTCGGCCAG CACGCCCGTG GCCCTGCCCC AGCTACCGGG GGATACCGCC AGCTCCGCGG GATGGATCAA CAACAGGGGT GTGATCGTCG GCTATTCGAA GACTGCAGCC GGCGTTGCGC GGGCCGTCCG GTGGAACCCT GACGGAACCG TCTCCAGACT CGCCGACCTG CCGGGCAGCG ACTCCAGTGA GGCGAGCGCC GTCAGCGACA CCGGCATCAT CGTCGGTCTC GCCACCACGG GTGCGAGGTC GCACGCGGTC CGCTGGGACC ACGCCGGTGG GATCACCGAG CTGCCGCCGC TGCCCGGCGA CACCGGTGCC GGTGCCTATG GCGTCAACGA GCGGGGGATC GTCATCGGCT TCTTAAGCGC GAGCGACGGC ACCCGCAGCG CGGTGGCGTG GAGTCCGAGC GGCCAGGTCA CCCGGCTGCC GACCGCCACC GCCGGGCCGG CGGAGGCGTA CGGCGTGAAC GACCGAGGCG CGGTCGCGGG GTCCTCCACC GCGGCCGACG GATCGGCTCA CGCAACACTC TGGCTGGCCT GA
|
Protein sequence | MTGTSTSPRV AGTTRRRRTR RVRGFRAIGA AVVALALPLV SAGPSSAGTG GSVIDLGTLP GGSSSGAYDV NNSAVVVGYS ASASGRNHAV RWNSLGVPTD LGTLPGDEAS VAFTINDAGT VTVGASLIPG GPTHPVEWDA AGQITALTTP AGSILSRAYA VNNQGTVIGF WSGPDRLYHA LRWTSASTPV ALPQLPGDTA SSAGWINNRG VIVGYSKTAA GVARAVRWNP DGTVSRLADL PGSDSSEASA VSDTGIIVGL ATTGARSHAV RWDHAGGITE LPPLPGDTGA GAYGVNERGI VIGFLSASDG TRSAVAWSPS GQVTRLPTAT AGPAEAYGVN DRGAVAGSST AADGSAHATL WLA
|
| |