Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2376 |
Symbol | |
ID | 3904583 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2753686 |
End bp | 2754600 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637879706 |
Product | hypothetical protein |
Protein accession | YP_481472 |
Protein GI | 86741072 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0510] Predicted choline kinase involved in LPS biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.149878 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGCC AGCATCGCGA CAAGCCGAGC GGGGACAGTA CAGAACTACT GCGGACAGCC TGCGTCCACG CGGGTGTTCC GTACGGCTCG GCCGAACTCC TCCGGGAGGG CGAGAACACC ATCTTCCGTC TCCCGGGCGG CATTATCGCC CGCATAAGCC GCGCAGGCCA GGAGAAAGTC GCAGCCAAAG AGGTGACGGT CTCCCGCTGG CTTGAAGAGA ACGGTTTCCC TGCCGTTCGC GCGCTGGCGC TGCCAGGCGC CGCCCATCAG CCAGTTGTCA TCAGCGGGCA CGCCGTGACG TTCTGGCAGG AACTACCCCC GCACCGACAC GGGACACCAC GGGAGGTGGC AGCAGCATTG CGGGCACTTC ACGCCATTCC ACCACCGACT GGATTTACGC TCGATCCCCT GTCACCCATG ACACGGCTAC GCGAGCGCAT CGAGGGGGCA CGCACGCTAA CCGCCGCTGA CCGTCACTGG CTGCGAAACC GAGTCGAGAC TCTAACAAGT CGGTACAACA GCCTGCCGCC AGGGCTGCCG CTCAGCGCCG TTCACGGCGA CGCGTGGGGA GGGAACCTCG TGGTCACCCC TGACGGCCAA ACCGTACTCC TCGACCTCGA ACGCTTCTCT ATCGGCCCAC CTGAATGGGA CCTCGTATCG ACCGCCATCA AACACAGCTC CTTCGCCTGG ATCACAGCCG GAGAGTACGC TGAGTTCGTC GAGGTTTACG GCCACGACGT CACGAACTGG AGCGGCTTCG AAACCCTCCG GGACATTCGC GAGCTACGCA TGGCCTGTTA CGTAGCTCAG CAGGCTTCAA AAGACGATGG ATGGCAATCC GAGGCACGGA AACGCGTGGA TTCCATACGG GGACTCCTCG GTCCGCGACC GTGGCCATGG CAACCTGCGC TCTGA
|
Protein sequence | MSGQHRDKPS GDSTELLRTA CVHAGVPYGS AELLREGENT IFRLPGGIIA RISRAGQEKV AAKEVTVSRW LEENGFPAVR ALALPGAAHQ PVVISGHAVT FWQELPPHRH GTPREVAAAL RALHAIPPPT GFTLDPLSPM TRLRERIEGA RTLTAADRHW LRNRVETLTS RYNSLPPGLP LSAVHGDAWG GNLVVTPDGQ TVLLDLERFS IGPPEWDLVS TAIKHSSFAW ITAGEYAEFV EVYGHDVTNW SGFETLRDIR ELRMACYVAQ QASKDDGWQS EARKRVDSIR GLLGPRPWPW QPAL
|
| |