Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4210 |
Symbol | |
ID | 5672565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5014241 |
End bp | 5015182 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641243083 |
Product | hypothetical protein |
Protein accession | YP_001508500 |
Protein GI | 158315992 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.672369 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.890959 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGTTCCG TGGACACGAC ACCGCTGACG CACGTCGAGG TCGACGGCCA CCCCGCGGGC GTGGACGACC TGCGCCACCT CGCCCTGGCC AACGACGGCC ACCTCACGAC GATGCAGGTC CGGGCCGGCC GGGTCCGGGG GCTGGCACTG CACCTGCGTC GGCTGGACCG CGCCAACCAG GAGCTCTACG GCACCTACCT GGACGGCGGC CTGGTCCGCG ACCGGATCCG GCACGCGCTG GCGACCACCC CGGCCCCGCA CGGCCCGGCG GAGAGCGGCG ACGCGACCGT GCGGACGGTG GTCTTTCCCA CCGGAACGGG GTCGGTCTCG ATCCTGGTCT CGATCGGCCC GCCGGCCGAG CCCGCCGCCG GCCCGCTGCG GCTGTGCTCG CTGCGCTACC AGCGGCCATT CCCGCACATC AAGCACGTCG GCAGCTTCGG GCAGATCCAC TTCGGCCGGC TGGCCCGCGG GCGCGGCTTC GACGACGCGC TGCTCGTCAC CGGGGAGGGC CTGGTCTGCG AGACCACCGT CGCCAACATC GGTTTCGTCA CCGCGGCGCC GGCCAGGGTG ATCTGGCCGG ACGGCCCGTC GCTGGTCGGC GTCACCATGG CGCTGCTCGA CGAGCGGCTG CGCCCCGGGG CCACCGCCGA GCCCGTCGCA CATCAAGTCC CCGCCGCGGC GACCGCGGCC GCGGTGACGG CGCCCGCCAC CGCCACCATC ACGCCCGCCG CCGCGGCCGG AGAGGAGCGC GGCCCGCTCC TCGAGTCCCG GCGGGAGCCG GTACGGCTGG CGGACGTCGG CCACTTCCGG GCCGCCTTCG TCGCCAACGC GCGGGGGATC GTGCCCGTCG ACCGGATCGA CACGACACCG GTCCCGGTGG ACGAGGCGGT GCTGGCCGCG CTGCGCCGCG CCTACGCCTC GGTGCCCTGG GACGAGATCT GA
|
Protein sequence | MGSVDTTPLT HVEVDGHPAG VDDLRHLALA NDGHLTTMQV RAGRVRGLAL HLRRLDRANQ ELYGTYLDGG LVRDRIRHAL ATTPAPHGPA ESGDATVRTV VFPTGTGSVS ILVSIGPPAE PAAGPLRLCS LRYQRPFPHI KHVGSFGQIH FGRLARGRGF DDALLVTGEG LVCETTVANI GFVTAAPARV IWPDGPSLVG VTMALLDERL RPGATAEPVA HQVPAAATAA AVTAPATATI TPAAAAGEER GPLLESRREP VRLADVGHFR AAFVANARGI VPVDRIDTTP VPVDEAVLAA LRRAYASVPW DEI
|
| |