Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5722 |
Symbol | |
ID | 5674048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6945719 |
End bp | 6947059 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641244575 |
Product | hypothetical protein |
Protein accession | YP_001509978 |
Protein GI | 158317470 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0160] 4-aminobutyrate aminotransferase and related aminotransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAGG CGAGCTCGGC CCACCCGGAG AACCGCGAGC ACGTGTTCTA CTCCTGGGCG GCGCAGGAGA CGACGCACCC GCTCGTGGTC GCAGGGGCCG AGGGGAGCTG GTTCTGGGAC AAGGGTGGCA CACGGTACCT CGACTTCTCC TCCCAGCTGG TGAACGCCAA CATCGGCCAC CAGCATCCGG CGGTGGTCGA GGCGGTCGTG GCGCAGGCCC GGGAGCTGAC CACGGTCGGG CCGCAGCACG CCCACGCCGT GCGCTCGGAG GCGGCGCGGC TGATCGCCGA GCGTGCTCCC GGCGATCTGG ACCAGGTGTT CTTCACCACC GGGGGAGCGG AGGCCACGGA GAACGCCGTG CGGCTGGCCC GGCTCGCCAC CGGGCGGAAC AAGATCCTCG CCGCCTACCG GTCGTACCAC GGTTCGACCG GCGGCTCGAT CACGCTGACC GGGGAGCCGC GGCGTTGGGC GAGCGAACCG GGTATTCCCG GTGTGGTGCA CTTCTTCGGC CCCTATCCGT ACCGGTCGTC TTTCCACGCC GCCGACGCGG CTCAGGAGGG GGAGCGCGCG CTCGCGCACC TCGCAGAGGT GATCGAGCTG GAGGGGCCGT CGGCGATCGC CGCGGTCATC CTCGAGCCGG TCGTCGGTAC GAACGGGATC CTGGTGCCGC CGGACGGGTA CCTCGCGGGC GTCCGGGAGT TGTGCGACGC GCACGGAATA CTGCTGATCG CCGACGAGGT GATGTCGGGT TTCGGGCGGT GCGGGGAATG GTTCGCGATC GACCACTGGA ATGTGGTGCC CGATCTGATC TGTTTCGCGA AGGGCGTCAA CTCGGGTTAT GTGCCGCTTG GCGGGGTGAT TATTTCGCAG CGGATCGCGG ACCTGTTCGC GCGGCGGCCG TATCCTGGCG GGCTGACCTA TTCGGGGCAT CTGCTTGCCT GCGCGGCGGC GGTGGCGTCC ATCAGGGCTT TCGAGTCCGA GGACATTCTC GGCCGGGCCC GTGCGCTCGG TTCCGAGGTT ATCGGGCCGC AACTGGCTAA AATTGCCGCG CGGCATCCCA GCGTGGGTGA GGTGCGCGGG CTCGGGGTGT TCTGGGCGGT CGAGCTCGTC CGTGACCGGG TTACCCGGGA GCCGTTGGTT CCGTTCAACG CGGCGGGTGC CGCGGCCGCG CCGATGGCGG CCGTGACGGC GGCCTGCCGG GAACGCGGCC TCTGGCCGTT CACCCATTTC AACCGGGTGC ATGTCGTGCC GCCGTGCACC ACGAGTCCCG AGGATGTCTC CCTCGGCCTG TCGATCCTGG ACGAGGCCCT GGCCGTCGCG GACGGCTGCT ACACCGGGTA A
|
Protein sequence | MTQASSAHPE NREHVFYSWA AQETTHPLVV AGAEGSWFWD KGGTRYLDFS SQLVNANIGH QHPAVVEAVV AQARELTTVG PQHAHAVRSE AARLIAERAP GDLDQVFFTT GGAEATENAV RLARLATGRN KILAAYRSYH GSTGGSITLT GEPRRWASEP GIPGVVHFFG PYPYRSSFHA ADAAQEGERA LAHLAEVIEL EGPSAIAAVI LEPVVGTNGI LVPPDGYLAG VRELCDAHGI LLIADEVMSG FGRCGEWFAI DHWNVVPDLI CFAKGVNSGY VPLGGVIISQ RIADLFARRP YPGGLTYSGH LLACAAAVAS IRAFESEDIL GRARALGSEV IGPQLAKIAA RHPSVGEVRG LGVFWAVELV RDRVTREPLV PFNAAGAAAA PMAAVTAACR ERGLWPFTHF NRVHVVPPCT TSPEDVSLGL SILDEALAVA DGCYTG
|
| |