Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3801 |
Symbol | |
ID | 5672165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4509500 |
End bp | 4510954 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242680 |
Product | fibronectin type III domain-containing protein |
Protein accession | YP_001508100 |
Protein GI | 158315592 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00993794 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.608011 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCCGG TTCTCGTCCT GCGGGACACG CGGTTGCAGG TCCGTCCCGG GGACACCGCC CGGACCAGCG CGACCGTGCG CAACGCCGGT GATCTCGTCG AGCAGTACGC CCTCGACGTG CTCGGACCGG CCGCCGCCTG GGCGGAGGTC ATCCCTCCCA CCATCTCGGT GGTCCGCCGC GGGGAGAGCA CCGTCCAGAT CCTGTTCCGG CCGCCGGTGG GGCCGACGAC CCCGGCGGGC ACCGTGCCGT TCGCGCTGCG CTGCGTCTCC CGGGAGAACC CGGACAGCGT CGCGATCGCC GAGGGGGACC TCGCCGTCGG GGCGATCCAC GAGATCGTCG CGTCGGTCAC CCCGGCGGTG TCCCGCGGCC GGTGGTCCGG CCGGTGGACC GCGCGCTTCG AGAACCGCGG CACAGCACCG GCCCGGCTGC GGCTGTCCGC CTCCGACGAA CGCCGGACGC TCGGTTTCGC GCTCGCCCCG GTGGAGCTGG AGATCCCCCC GGGCGAATCC GGTTACGCCT TCCTGAAGGC GCGAGCCGCC AAGCCGGCGC TGCTCGGTGC GCTGACCCGG CAGCAGGTGC GGCTGACCTA CACCCGGGAG ACCGCGCCCG ACGAGCCGGT CGCCGAGGGT TTCGTGGATG TCTCCTTCGA ACACGTCCCG GTGCTGTCCC GGGCGATGAC GACGATCGCC GGCCTCGCCC TCGTGGGCGG AGCCGCCGCC GTAGTCCTGC TGAGCCAGTC GAGCCCGAAG GACGACACGG CGGCGCCGGG GGCCGCACCA CCGGCACCGA CCACCTTCTC CGCGGAGACC GGTGACGGCG GTGTCGTCCG CCTGAGCTGG TCGACGGTCC CCGGGGCGAA GGAGTACGGA ATCCAGAAGC TGGTCGGGGA CGAGGACGTC GCGCTCGACA CCAAGAGGGT CGACGGGCAG CTGAACGCCT ACGACTTCAC GGGGCTCAAG GGCGGCGAGC GGACCTGCTT CCGACTGGTG GCGTTCAACG ACTCCGGGGC TTCGCAGCCG TCCCCGCACG CCTGCGCCAC CGCCGGGATC ACCCCGGAGC CGAGTCCCGG GCCCACGCCG ACCGGACCCA CCCCGACCGG ACCAACGCCC ACGTCCCCCA CACCCACGCC CGAGACGCCG AACCCGCAGC CCGGTCCCGG CACGCGGGAG CCGCGGGACG CCTACGTCGT CCTCAGTGTG TTCGCCAAGG ACGACCAGGT CGCGAACGAC GCGCAGAAAC CTGCACAACG TGCCGGCGAG ATCGGCTCGG CGCTCGGCGT CGACGTCGTC CTCGCCGACG CCGACAAGTC GACCCGGCTC TCCGCCCAGT ACCCGGGCTT CCTCGTGATC TACGCGGACC GCTTCGTCAC GCCCGAGGAC GCCGGGAAGT TCTGCACCGA CATGAGCGAC AAGCTCAGCA CCGTGAAGGC CATCTGCGTG GCGCAGAACA ACTGA
|
Protein sequence | MDPVLVLRDT RLQVRPGDTA RTSATVRNAG DLVEQYALDV LGPAAAWAEV IPPTISVVRR GESTVQILFR PPVGPTTPAG TVPFALRCVS RENPDSVAIA EGDLAVGAIH EIVASVTPAV SRGRWSGRWT ARFENRGTAP ARLRLSASDE RRTLGFALAP VELEIPPGES GYAFLKARAA KPALLGALTR QQVRLTYTRE TAPDEPVAEG FVDVSFEHVP VLSRAMTTIA GLALVGGAAA VVLLSQSSPK DDTAAPGAAP PAPTTFSAET GDGGVVRLSW STVPGAKEYG IQKLVGDEDV ALDTKRVDGQ LNAYDFTGLK GGERTCFRLV AFNDSGASQP SPHACATAGI TPEPSPGPTP TGPTPTGPTP TSPTPTPETP NPQPGPGTRE PRDAYVVLSV FAKDDQVAND AQKPAQRAGE IGSALGVDVV LADADKSTRL SAQYPGFLVI YADRFVTPED AGKFCTDMSD KLSTVKAICV AQNN
|
| |