Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1800 |
Symbol | |
ID | 5670202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2162137 |
End bp | 2163366 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641240721 |
Product | aminotransferase class V |
Protein accession | YP_001506144 |
Protein GI | 158313636 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.345473 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCCAC GCGGCCCCGC AACGTCCGGC CAGCTGCTCC GGGACGGGCG TAGCGTCACT CTCGTGCCGG CCTATCTCGA CCACGCTTCG ACCACGCCGC TGCATCCGGT CGCCCGGGAG GCCCTGCTCG CCGCGCTCGA CGACGGCTGG GCCGACCCCG CACGGCTCTA CCGCGAGGGC CGCCGCGCGC GGATGCTCCT CGACGCCGCC CGGGAGACCG TCGCCGGCGT CCTCGGCGCC CGGACCAGCG AGATCAGCTT CACCGCGAGC GGGACGGCCG CGGCGCACCA GGCGCTGCTC GGCACCGCCG CCGCCCGCCG CCGCGCCGGG CGGGTGGTTG TGGTCAGTGC CGTGGAACAC TCCAGCGTCC TGCACGCCGC GCAGCGCCAC GAACGTGCCG GCGGCGAGGT CGTCACGATC GGTGTGGACG GCCTCGGCCG CGCCGACCCG GCCGCGTTCG AGGCCGCGCT CGACGCCCAC CCGGGAACGG CCGTGGCCGC CCTGCAGCAC GCCAACCACG AGGTGGGCAC CGTCCAGCCG GTCGCGGCGG TCGCGCGGGC GCTGCGCCGG CGCGGGGTGC CGCTGCTCAC CGACGCGGCG ACGACGGTCG GGCGGGTTCC CGTCGACCTC GCCGAGCTGG GCGCGGACCT GCTCACCGCG AGCGCACACA AGTTCGGCGG GCCGCCCGGG GTGGGCATCC TCGCCGTCCG CACGGGCACC AGGTGGGCCA ACCCGCTGCC GGCGGACGAG CGGGAGCACG GGCGGGTTCC CGGCTTCCCG AACGTTCCCG CCGTCGTCGC CACGGCTGCC GCGCTCGCCG TGCGCGCCAC CGAGATCGAC GCGGAGGCGC CCCGGCTCGC CGGCTACACC GAACGCCTCC GCCGGCGCCT GCCGGAGCTC GTCGAGGACG TCGAGCTGCT CGGCCCCGGC GGCGCCGACC CGGCGGTGGG ACTGCCGCAC ATCGTGGCCT TCTCATGCCT TTACGTCGCG GGCGAGGCAC TCCTGGACGA GCTCGACCGT GCCGGCATCG CCGTCAGCTC CGGGTCGAGC TGCACCTCGG ACACCCTGAC GCCCAGCCAC GTCCTGGTGG CGATGGGCGC GCTGACCCAC GGCAACCTCC GCGTGTCGTT CGGGCGGGAC TCCACCGACG CCGATCTGGA GGCGCTGCTC GACGCGCTGC CGCCCGCCGT GCGCGCCGTC CGCGAGCGCG CCGGGGCGGC AGGCCTGTGA
|
Protein sequence | MPPRGPATSG QLLRDGRSVT LVPAYLDHAS TTPLHPVARE ALLAALDDGW ADPARLYREG RRARMLLDAA RETVAGVLGA RTSEISFTAS GTAAAHQALL GTAAARRRAG RVVVVSAVEH SSVLHAAQRH ERAGGEVVTI GVDGLGRADP AAFEAALDAH PGTAVAALQH ANHEVGTVQP VAAVARALRR RGVPLLTDAA TTVGRVPVDL AELGADLLTA SAHKFGGPPG VGILAVRTGT RWANPLPADE REHGRVPGFP NVPAVVATAA ALAVRATEID AEAPRLAGYT ERLRRRLPEL VEDVELLGPG GADPAVGLPH IVAFSCLYVA GEALLDELDR AGIAVSSGSS CTSDTLTPSH VLVAMGALTH GNLRVSFGRD STDADLEALL DALPPAVRAV RERAGAAGL
|
| |