Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4247 |
Symbol | |
ID | 5672602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5059027 |
End bp | 5061438 |
Gene Length | 2412 bp |
Protein Length | 803 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243120 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001508537 |
Protein GI | 158316029 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCTGGA GGGTAGCGGC CCTACGACGA CTGGAGCGGC TCCGGACTCT GCGGATCGTC TCACGGGGGC GTCCACCGCG GGCACCGGAG TCCAACCAGC CCCGCTTGGC CACGCCCCTC GCCCTGGTGG CCCTGGGAGC CCTCGTCGCC GCCTCCGGAT TCCGCGGTGC CTTCGAGCTG ACCGCGCGTG TCGCTGTGCC AGTCGGGCTT GCGGCTGTCC TGCCGCCGTT GATAGCCGCC GTCATGGCTG TGAGCACCCC GGCCCGGAAG GTTTGGCTGA CGGCCGCGAC GTCCACCGCC GCGTGGCTGG CCGTCCTCGT CCTGACCGTT TTCGGACCGT CCGGAGCATG GTTTCCGGCC GGTCTCGTAC CCGATGCGCT GGCTCATGGG TTGGACCGGT TGTTACAGAT CACACTGCCC GCACCGCCGC GAGCTGATCT GCTGGCGGTG GTGGTGACAC TGATCTGGCT CGCCGCCGCG TCCGCGTCCA TGCTCGTCGC CGCGGGCACC GGACGGGACA CGCTGACCCC GGTGGCCCCG ATCGCAATGC TTTTTGTCGC GGCGACCCTG ATCAGCCTCC CCGGCCCTGG TTCACACGTC TCGACCGCGT CCGTCCTGGT CGCGGTGTTC GCGCTCATCG CAGCCGTCAG CCGACCGTCG GCCGGGCGGT GGGACCGGGC CGCCGGACCG GCACCGGTGG CGGGTGCGCC GCCGGCGGCC GGCGCGTCGC CTGCGCAGCG GCGGGCCCAC GGAACTGGGC GGCGGCCGTC AACGAAGGCA GGAAGCCTGC GGCACCTGGT CGCGGTCCTC ACCATCACCT GCGTTGCGGC TGGCTCGTTC GGTCTGGCCT CCCTGGTCAC CTTGCTGGAC CCCGATCCGT TCGACGTCCG CGAGTACCGG TCGCCGCCGG CGCGGACCGT CCAGCAGGTT GACCTACTGG CCCTCGTTTC AGCCTGGCAG GCCGAGCCCC GGACCCGGCT GTTCACGCTG GAGGCGACCG ATGCGTTGAT CGAGGGCGCT CCGCCGGACC GAATGCGTCT CGCCGTCCTG GACCGCTACG ACGGCAGGAA CTGGAGCAGC GCGTCCCGCT ACATCCCGGC GGGCCTCGCT GTTCCAGCTC CGGAGGATGT GGATGGACCG GTAGCCAGAC ACACGCTGAC AATCGATCGT CTGAGCGGCC CCTACCTGCC GGTCCTTGGT TGGCCGACCC GGCTCGACGC GTCCGGGCTG CGCGCCGGGC GGCGCACCAG CGCGCCGCTC GCGGTCGACC TCGAGAACGG TCTCCTTTCC GCGGACGCGC GCCTGGAGCC GGGCCGGACG GTGGAGATCA CGTCGATACT CGGCGCCGCA CCGACCCTGC GTGAGGCCGG CGAACGGCCG GTCGACCCGA TTGCCGAACC CGTTCTCGCT CTGCCGGGTA ACGTGGCGCC GCCGGGCGAC CTGACGGCGA TCGCGCAGAA GGCATCGCGG TTGGCGGCCG TCCCAGGTAA ACGAGCTTCG GTACTTGCGG ATCTGCTGGC CCAGGGTCGG AAGCTTGATC GGACCGTGAT CTCCGGCAGC TCACTCGGCG ATGTCGAGGC GTTCCTCGGG GCGAGCCGCG TTGGTGGCGA CGCGCTGTTC GCGACCGCGT TCGCACTGGC AGCCAACACC ATCGGCCTCC CGACCCGGCT GGTGATCGGG TTCGACCGGC CCAGTTCGGC ATCTGACGGT GCTGTCCACG CCGGTGACGT CCGTGTCTGG CCGGAGGTGC GGTTCGCCGA GGTCGGCTGG GTTCCGGTGG ATCTCGGCTC GAGCTCGGCC ACCGGTACCG GCGAGACGCC GCTACCGTCA GCCGGGACTC CTGATACAGG CGTGCATTCG TCCACGCCGT CCACGCTCAG TCCGCCCGCG AATGGTGTGC GGGTGGCTCC ACCGCGTACC CCGCTTCCCC GGCCCGACCG CCCGGTGTGG GCCATGATTA TAGCGGTGAT TGTCGTCGTC CTCGGCCTTG CCGTTACCGG GGCGTGGTGG GCCACCAGGG CGGAACGACG GCGCCGACGC GCCCGTTCCG AGGCGTCGCC CCGCCGTCGT CTGATCGAGG CATGGTGGGA CGCGGTCGAG ACGATGGGTG GTCGGCGCCG TACGGTCCTG TCCTCCGACA CCTGCGCCGA GGTCGTGCGC GAGGCTCGGG AGGTCTACGG CGAGCGTGCC GCCGAGCCGC TCGCCGAACT CGGCACGGAC GCCGCGAGGG CGCTCTTTTC CGCGAGTGAT CCGCGACCTG CGGAGGCGGA CCGTGCGTGG GACCTGAACC AGCGCTTCCG TAAGCGCCTG CGGACCGAGC GCCGCCGTCG GCGCCGTGCC GCGGTGCGCG CAGCACCACG GCACCTCAGC CGGGCGCTGC GCAGGGCCGG CGCGCAGGCA CGTCGGCGAT GA
|
Protein sequence | MTWRVAALRR LERLRTLRIV SRGRPPRAPE SNQPRLATPL ALVALGALVA ASGFRGAFEL TARVAVPVGL AAVLPPLIAA VMAVSTPARK VWLTAATSTA AWLAVLVLTV FGPSGAWFPA GLVPDALAHG LDRLLQITLP APPRADLLAV VVTLIWLAAA SASMLVAAGT GRDTLTPVAP IAMLFVAATL ISLPGPGSHV STASVLVAVF ALIAAVSRPS AGRWDRAAGP APVAGAPPAA GASPAQRRAH GTGRRPSTKA GSLRHLVAVL TITCVAAGSF GLASLVTLLD PDPFDVREYR SPPARTVQQV DLLALVSAWQ AEPRTRLFTL EATDALIEGA PPDRMRLAVL DRYDGRNWSS ASRYIPAGLA VPAPEDVDGP VARHTLTIDR LSGPYLPVLG WPTRLDASGL RAGRRTSAPL AVDLENGLLS ADARLEPGRT VEITSILGAA PTLREAGERP VDPIAEPVLA LPGNVAPPGD LTAIAQKASR LAAVPGKRAS VLADLLAQGR KLDRTVISGS SLGDVEAFLG ASRVGGDALF ATAFALAANT IGLPTRLVIG FDRPSSASDG AVHAGDVRVW PEVRFAEVGW VPVDLGSSSA TGTGETPLPS AGTPDTGVHS STPSTLSPPA NGVRVAPPRT PLPRPDRPVW AMIIAVIVVV LGLAVTGAWW ATRAERRRRR ARSEASPRRR LIEAWWDAVE TMGGRRRTVL SSDTCAEVVR EAREVYGERA AEPLAELGTD AARALFSASD PRPAEADRAW DLNQRFRKRL RTERRRRRRA AVRAAPRHLS RALRRAGAQA RRR
|
| |