Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4117 |
Symbol | |
ID | 5672475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4900559 |
End bp | 4902229 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242993 |
Product | TAP domain-containing protein |
Protein accession | YP_001508410 |
Protein GI | 158315902 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGAG ATGAGAAACG CACGCGGATG TGCGGTGTCG TTGTCGGCGC CGCGGCCCTC GTCGGGGTCG GCTCCTTCAC CGGGCCGATC CCCGGGGCGC TGGCGGCGGA CGGCGCGCCC GATCCCGCCC CGCCCGGTGT GGCGAGCCTG AACCCGGCCC TCGCCCCGTT CGAGAACCAG CCGGTGCGCT GGCACGAATG CCGGACCGGC CCCAACGACG CGATCGGCTC CTATCTGGAC GCGGCCGGCG CGCAGTGCGC CGGGATCAGC GTCCCGCTGG ACTACGCCCG CCCCGACGGC CGGGAGATCA CCCTCGGGGT GTCCCGGATC AGGGCAGCCG ACACCGCCCA CCGGCGCGGC ATTCTGATGA TCAATATTGG CGGTCCCGGC GGCCCTGCTC TGGACGCCAC CCCGGACCTG CGTGGGCTGC TCGGTGCGGC GGCGGACGGC TTCGACGTGG TCGGGATGGA CCCTCGTTTC GTCGGCCGCA GCGCACCGCT GGACTGCGGG CCGATCCTGA AGCGGCCCTG GCCGCGCGCC GGCGGCCCCG CCCAGGACAG TTTCGATCGC ACCGCCGACG GTCAGGCCGC GATAGCGCAG GCGTGCGCGG TGCACGCCGA TGTGCTGCCG TTCGCGTCGA CGCGGAACAC CGCTCGTGAC ATGGATGTTG TCCGTGCTGC GCTTGATGAG CGGAAGACGT CGTTTCTTGG CTTTTCCTAC GGCACCTACC TGGGTGCGGT GTACATGCAG ATGTTCCCTG ACCGGGTCGA TCGTTTCGTG CTCGACAGCG CGGTGGATCC GGCGACGTAC AACCCGCGGG TGCTGCGCGA CACCGGGGAT CTGCTCGAGG GCGCGCTGCG GGAGTGGGCC GGCTGGGCCG CCGGGCGGGA CGCCCAGTTC GGGCTTGGCA GCACGGCCGC GGAGGTGCTG GCCACCGTTG ACCGGATCTA CGCCGCGGCC GTGCGCGGTC CGCTGACCGT GGACGGCGTG GAGTACGACG CCGGGGACAT CCCGGGGCTG CTCATTGATG CACTCGTCGA CGACAGCGCT GAGGCGTCCG ACGTCCTGGC GATGGGTGTG CGGGCCTTCG CTGACGCGGC GGATGGCCGT GCCCCGGCGG AGAACCCCTA CCTCGACGAG TTCTTGGACG GTGTCGCTAC CGGCGGTCCG ACGCTGCCCG GCGGGTCACC CGGGCGCCGG GTGGAGGCGG TGCCGGCGGG GTCGATCAGC GCGTTCCAGA GCGCGCAGCT CGCGGTCCTG TGCGGGGATG TGCCGGCGTC CCGTGTGGCC GGTGAGTACC TGGCGGACAT CCGCCGTCAT CAGCGGGCGC AGCCGCATGT GGGTGCGGCG ATCTGGAACC TGACCCCGTG CACGTTCTGG CCGGTGCGCC CGGTGGAGGC ACCGACCCGG GTGGCGAATG CTGTGCCCGC GCTGGTGGTG GCCGCGGAGA AGGACAACCG CACGCCGTAC GCGGGCAGCC GGGCGCTGCA CCGGGCGTTG TCCTCGTCGC GACTGGTGAC GTTGCGCGGA GCACGGGTGC ACGGCGTGTA CGGCGTGCGC AGCGGCTGTG TCGACGACGC AGTCAATGCC TACCTGCGGT CGGGCACCCT GCCCAGCGCC GACCTCACCT GCACCCGTCC GCCGGCTCCC CCGGGTTCGC TTCCGGAGTG A
|
Protein sequence | MRRDEKRTRM CGVVVGAAAL VGVGSFTGPI PGALAADGAP DPAPPGVASL NPALAPFENQ PVRWHECRTG PNDAIGSYLD AAGAQCAGIS VPLDYARPDG REITLGVSRI RAADTAHRRG ILMINIGGPG GPALDATPDL RGLLGAAADG FDVVGMDPRF VGRSAPLDCG PILKRPWPRA GGPAQDSFDR TADGQAAIAQ ACAVHADVLP FASTRNTARD MDVVRAALDE RKTSFLGFSY GTYLGAVYMQ MFPDRVDRFV LDSAVDPATY NPRVLRDTGD LLEGALREWA GWAAGRDAQF GLGSTAAEVL ATVDRIYAAA VRGPLTVDGV EYDAGDIPGL LIDALVDDSA EASDVLAMGV RAFADAADGR APAENPYLDE FLDGVATGGP TLPGGSPGRR VEAVPAGSIS AFQSAQLAVL CGDVPASRVA GEYLADIRRH QRAQPHVGAA IWNLTPCTFW PVRPVEAPTR VANAVPALVV AAEKDNRTPY AGSRALHRAL SSSRLVTLRG ARVHGVYGVR SGCVDDAVNA YLRSGTLPSA DLTCTRPPAP PGSLPE
|
| |