Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0394 |
Symbol | |
ID | 3903636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 467350 |
End bp | 468570 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637877723 |
Product | cupin 4 |
Protein accession | YP_479510 |
Protein GI | 86739110 |
COG category | [S] Function unknown |
COG ID | [COG2850] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGATGG ATCATCAGCT CATCCGCGAT ATAAAGACCG CGTTGGCATG GTCCAGACCC AGCTCGACAC CCTCCCGGTT CCTACGGGGG ACTCTTCCTG ATTCTGGGAT ATGCCCCAGG GTGCTCACGC CGACCAGATT TCTAGACCTG ATAATGAGAC GAAGCCTTGT CTCGCCCCAG ATGCGGTGCT TTCAAAACGA ATCAGAGCTG CATCCAAACT CCTTGCTTCA GATGAACACG ACGAGGCGAG GGCAGGTCAC TCCGATGGTC GATATGCGCC GGCTGGCCGG ACTTTTGCAA TCAGGCTGCA CCCTGGTGTT GGACGCGGTC AACCACTTTG ATCCGACGCT AGAGGTTGCG TGCCGAGCGT TTCAATGGTG GTTGCGCGCG CCTGTGCAGG CGAACGTGTA CCTCACAACG GGCGACGCGG CGGGTTTCTC CCTACACTGG GATGATCACG ACGTCATCGT CCTACAGTTA GCTGGCGACA AAGAGTGGGA GGTCCGGGGT CCGTCTCGCC GTGCTCCCAT GTACCGCGAC GCCGCACCCA ATACTGAGCC TCCCAAAGAC ATCGTGTGGT CTGGTACCGT GAATACCGGT GACGTGTTGT ATATTCCCCG GGGCCACTGG CATCGGGCGA GCCGCACCAG CAGAGGTGAC GGGTTTAGCC TTCATGCTAC CTTTGGGTTC ACGAGACGAA CGGGCGTCGA TTGGTTGGCT TGGCTTGCGG ACCAGTCGCG CCGCGAAGAG GTGTTTCGGG AGGATCTGAA TCAGCGGGGA GAAGACCCGA AAGAACATCA AAACGACGGC GAGAAAATTA TTGTTGCCGC ATCGCGTCTT CTTACGTCAC ATCCGCCGGC CCATTACTTG GAATCCGTGG CGCATGCCAC CTCCGCAGGC CGGTATGTCT CCACAGCCGG CATTTTTGGT CCGCCATCCG CGGTCGTGTG CGTTACTGAT TTTCCACCTC AGATAGAGAC CCAGGGCGAT ACAGTGGCGG TCGCGACGGC GGAGAAACGG ATCGTCTTCA CCAGGAAAGC ATTACCAGCC CTTGGGTTGC TTCTGTCGGG CAATCCTGTG TGCCTTGACT ACGTATCGTC CGCAGCGGGG ATCGATGGCG CGCGCCTTGG GGAGATACTT GTCCGGGAGG GCATATGCGC GGAACTGACT CCGGAATTAT TCTCGGGCTA TACCGGTCTG ACCACAGACG GCAAGCTTTA G
|
Protein sequence | MLMDHQLIRD IKTALAWSRP SSTPSRFLRG TLPDSGICPR VLTPTRFLDL IMRRSLVSPQ MRCFQNESEL HPNSLLQMNT TRRGQVTPMV DMRRLAGLLQ SGCTLVLDAV NHFDPTLEVA CRAFQWWLRA PVQANVYLTT GDAAGFSLHW DDHDVIVLQL AGDKEWEVRG PSRRAPMYRD AAPNTEPPKD IVWSGTVNTG DVLYIPRGHW HRASRTSRGD GFSLHATFGF TRRTGVDWLA WLADQSRREE VFREDLNQRG EDPKEHQNDG EKIIVAASRL LTSHPPAHYL ESVAHATSAG RYVSTAGIFG PPSAVVCVTD FPPQIETQGD TVAVATAEKR IVFTRKALPA LGLLLSGNPV CLDYVSSAAG IDGARLGEIL VREGICAELT PELFSGYTGL TTDGKL
|
| |