Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1385 |
Symbol | |
ID | 3903366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 1664419 |
End bp | 1665471 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 637878722 |
Product | hypothetical protein |
Protein accession | YP_480491 |
Protein GI | 86740091 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.173259 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0076761 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGGACC CCGCGGACCG GCTCCGGGCC GGCTCCGTGT CGGCGACCGA TCCGGCGGTG GAGCGGACCC TGCGCCGGCT CGAACTGACC GTGCTCCGCC GTCTCGACGG CATGCTCCTC GGCGATCATC TCGGGCTGCT GCCCGGTCAG GGCAGCGAGA AGGCCGAGAG CCGCGAGTAC GCCGTCGGTG ACGACGTGCG CCGGATGGAC TGGGCCGTCA CGGCCCGGAC CACCGTCCCC CACGTCCACG ATCTCATCGC CGACCGGGAG CTCGAGACCT GGGCGCTGGT GGACCTCACC GCCAGCACCG AGTTCGGCAC CGGTACCTGG CAGAAGCGTG ATCTGGCGAT TGCCGCCACG GCCGCGGTCG GCTTCCTGAC CGCCCGCACC GGCAACCGGA TGGGCGCGAT CGCGCTGACC CAGGCCGGTC CACGGGCGAT CCCGGCCCGG CCCGGCCGGG ACGGGCTGCG GGCGCTGCTG CACACGCTCA TCACCGCGCC TCGCGGCGCG CACGACCGGC CCCGCCGCCG GCCCGATCCG GCGGCGGCCA CCGACCTCGC CGCGGCGGTG GAGGGCCTGC TGCGCCCGCC GCGCCGCCGG GGGCTCGCGG TCGTCGTCTC CGACTTCCTG TCCACCGACC TGCGCTGGGA ACGGCCGCTG CGGGTGCTCG CGGCGCGGCA TCAGGTGCTC GCCATCGAGA TCGTTGACCC GCTCGAGCTG ACCCTGCCAC CGGTCGGGCC GCTGCCCGTC GTCGACGCCG AGACGGGGGC GTTCGTCGAG GTCCCGACGT CGTCGCGGCG GTTGCGGGAA CGTTACGCCG CGGCCGCCGT CGCCCACCGC AGCGCGGTCT CGCTGGCGTT GCGCCGGGTC GGCGCCGACC ATCTGGTCCT GCGCACCGAC TCGGACTGGC TGGTCGACAT CGTCCGGTTC GCCTCGGCGA ACCGGCGCGG CCGGGGAGCC GGCCGCCGCC CGCCCGTGGG TCGGACCGAC CAGGGTGGCA CGGCGGCGCC CGCCACCAGT CGGGCAGGCA CCAGTCGGGC AGGCAGCCGG TGA
|
Protein sequence | MTDPADRLRA GSVSATDPAV ERTLRRLELT VLRRLDGMLL GDHLGLLPGQ GSEKAESREY AVGDDVRRMD WAVTARTTVP HVHDLIADRE LETWALVDLT ASTEFGTGTW QKRDLAIAAT AAVGFLTART GNRMGAIALT QAGPRAIPAR PGRDGLRALL HTLITAPRGA HDRPRRRPDP AAATDLAAAV EGLLRPPRRR GLAVVVSDFL STDLRWERPL RVLAARHQVL AIEIVDPLEL TLPPVGPLPV VDAETGAFVE VPTSSRRLRE RYAAAAVAHR SAVSLALRRV GADHLVLRTD SDWLVDIVRF ASANRRGRGA GRRPPVGRTD QGGTAAPATS RAGTSRAGSR
|
| |