Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1436 |
Symbol | |
ID | 3903167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1729214 |
End bp | 1730254 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637878773 |
Product | hypothetical protein |
Protein accession | YP_480542 |
Protein GI | 86740142 |
COG category | [S] Function unknown |
COG ID | [COG0327] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00486] dinuclear metal center protein, YbgI/SA1388 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.14453 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0780457 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGACCT TTCCGTCAGC GACAGTGCGT GACTGTGTGG CCGTGCTTGA GCGTTTCTTC CCTCCGTCCT GGGCCGAGTC CTGGGACGCG GTGGGACTGG CGGTGGGTGA TCTGGACGCG CCGGCCTCGG CGGTCCTGTT CGCGGTGGAT CCGACCCCCG AGGTGGCGGC CGAGGCCGTC ACGGCGGGAG CTCAGCTGCT CGTCACCCAC CACCCGCTGT TCCTGCGGGG AGTGCACGGC GTCGCCGAGA CCGGTCCGGG CGGGCGGGTC GTTGCCACCC TGATCCGGGC CGGCGTCGCC CTGTACACCG TGCACACGAA CGCCGATGTC GCCGCGCCGG GCGTGAGTGA CGCGCTTGCC GACGCGCTGG GGTTGCGTGA CGTCGCCCCG CTGTCCGCCC CGGCCCCGCT GTCCGCCCCG GCGAACGTCG GGGCCGACGA GGGCGGTGCC GGGGCCGACG AGGGCGGTGC CGGGGCCGAC GAGGGCGGTG CCGGGGCCGA CGAGGGCGGT GCCGGGGCCG ACGAGGGCGG TGCCGGGGCC GACGAGGGCG GTGCCGGGAG CAGCGCCGGT GATGAGGGTG GTGTCGGTGA TGAGGGCGGT GTCGGCGGGG CGAGCGTGTG CTGGGGCCTC GGCCGGATCG GTGACCTGCC CGCGGCCGAG TCCCTGGAAC GGTTCTGCGC CCGGGTCGTC CGAGCGCTGC CGGCGACGGC GGGCGGGGTC CGGGCGACGG GTACCGCGGA CCGGCTCGTC CGGCGGATCG CGGTGTGCGG CGGCTCGGGG GGAGAACTGG TCGCCGCAGC CGAGGCCGCG GGAGCGGACG TCCTCGTCAC CGCGGACGGC CGGCATCACC ACGTCCTCGA CGCCGTCGGT GCGCATGGGG TCGTGGTGGT CGATGTCGCC CACTGGGCGA GTGAATGGCC CTGGCTGAGC CAGGCCGCCA CCCGCCTGCA CACCGGTCTG CGTGCGAGGG GACGTACGGT GAGCACGTCG GTATCCGCTA TCGTGACCGA TCCCTGGCGG GTGCACGTTC CGTCCCGGTA G
|
Protein sequence | MVTFPSATVR DCVAVLERFF PPSWAESWDA VGLAVGDLDA PASAVLFAVD PTPEVAAEAV TAGAQLLVTH HPLFLRGVHG VAETGPGGRV VATLIRAGVA LYTVHTNADV AAPGVSDALA DALGLRDVAP LSAPAPLSAP ANVGADEGGA GADEGGAGAD EGGAGADEGG AGADEGGAGA DEGGAGSSAG DEGGVGDEGG VGGASVCWGL GRIGDLPAAE SLERFCARVV RALPATAGGV RATGTADRLV RRIAVCGGSG GELVAAAEAA GADVLVTADG RHHHVLDAVG AHGVVVVDVA HWASEWPWLS QAATRLHTGL RARGRTVSTS VSAIVTDPWR VHVPSR
|
| |