Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3100 |
Symbol | |
ID | 3904226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3671691 |
End bp | 3673001 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637880421 |
Product | hypothetical protein |
Protein accession | YP_482186 |
Protein GI | 86741786 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0288555 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.686964 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGAGC CGCTTCCGGC GCCGGTGCGT GCGCATGTCG TCGAGTTCGC CGAGCGGACG CTGGCGGATC TTCCCGAGTC GCAGGTCCCG CCGTCGCTGG TCGCGGTGCG CCGCTTCAAG CCCTCCCGCC GTGTCCGCCA GGGGGCCGTC CCGCTGGCCG CGGCGGTGGA CGGTGACGTG TTCCGCGGCC GGGTCGCCGA GTGGATCCAC CGTCATCATC CCGAGCTTGT CGAAGCCGTC AGCTCGCCGG ACGGACCGCC GCCCGCGGCG CCCCCCGAGA AGATCGCCGC GGTCGCGTAC CTGCTGCGGG TCTCCGGCTG GCAGGAGCTG GTCCAGGTCG CCGCGGCGTC CACGTCGGAG GCGGCGGCCC GCAGCCAGGT TGACGAGGCC GGGCGGACGA TACTGCGGCT CACCGAACAA CTCGAGACGA GCAAGCGCAT CGCCGCGGCC GAGCAGGAGG AACTGCGCGA GCAGCTGCAG GCGGCCCGGG CGGAGGCGGA CGAGGCTCGC CGGCGGCTGC GCTCGTCCGC TGCCGGGATC CGGCAGGCCG AGCAGGCCAC GCGGGAGGCC CTGACCGCGG CCGAGGCGGC CCGGAACGCC GCCCTGGCCG CCAGTCGGGA TGCCGAGGCG GAGACCCGGC GGCTGCGCGG CCGGGTGGCG GAGCTGGAGA GCGCGCTCGC CTCCACCCGC CGGGACAGCC GCGAGTCCCG CAGCGTCGAT GACGCCCGGT TGCGCGTTCT ACTTGACACC CTGATCGCCT CGGCGCACGG CCTGCGACGC GAACTCGACC TCCCGACGAT GGTCGCGAGG CCCGCGGATC TCATCGCTCG CGGCGGAGCC GGTCCGAGTA ACGGCCCGCA GGCGTTCGTC GGGGCCCGCG GGCGGCCGGA CGACGACCCC TCCCTGATCG ACGAGGTGTT GGCCGTCCCC GGGGTCCATC TGATCATCGA CGGGTACAAC GTGACCAAGC GGGGCTACGG CCGCCTCACC CTGCAGGCCC AGCGTGAGCG GCTGCTGTCC GGACTCGGTG CGCTCGCGGG CCGCAATCCC GACAGCGAGG TCACGGTCGT CTTCGACGCC ACCGCCGTCG TGGCCCGGCC GGTGGGTGTC GCCATGCCTC GGGGGGTGCG CGTGCTGTTC AGCCGGCCCG GGCAGCTCGC TGACGAGGAG ATCGTCCGGC TGGCGCGGAT GGAACCGGAA GGCCGCCCGG TCTTCGTCAT CACCTCCGAC CGGGAGGTCG CGGAGAACTG CGTCGCAGCC GGAGCCCGGG CGGTGCCCTC GGCCGCTCTG CTGGCCCGCC TCGACCGGTA G
|
Protein sequence | MREPLPAPVR AHVVEFAERT LADLPESQVP PSLVAVRRFK PSRRVRQGAV PLAAAVDGDV FRGRVAEWIH RHHPELVEAV SSPDGPPPAA PPEKIAAVAY LLRVSGWQEL VQVAAASTSE AAARSQVDEA GRTILRLTEQ LETSKRIAAA EQEELREQLQ AARAEADEAR RRLRSSAAGI RQAEQATREA LTAAEAARNA ALAASRDAEA ETRRLRGRVA ELESALASTR RDSRESRSVD DARLRVLLDT LIASAHGLRR ELDLPTMVAR PADLIARGGA GPSNGPQAFV GARGRPDDDP SLIDEVLAVP GVHLIIDGYN VTKRGYGRLT LQAQRERLLS GLGALAGRNP DSEVTVVFDA TAVVARPVGV AMPRGVRVLF SRPGQLADEE IVRLARMEPE GRPVFVITSD REVAENCVAA GARAVPSAAL LARLDR
|
| |