Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7028 |
Symbol | |
ID | 5675339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8572164 |
End bp | 8573423 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641245874 |
Product | hypothetical protein |
Protein accession | YP_001511265 |
Protein GI | 158318757 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.557944 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAAAA CCTTAAAAAG AAGGCGGCCA TTCTTTATGG TCTCCGCCGT ACTTTGCGGC GCGCTAATTC TCGCTGGTTG TAGTCCAGTC GGCGAGCAAC CGTCCGTATC AACAGCCAGC CAGGCGTGCA ATACGCCCGG AATCACCGCA GGCGAGGTCC GCCTTGGGTT CTTGATGTCC GAGAGTGGCG CGGGAGCATC GATGGCCCAG CCGTTCAGGG CGGGCGTCGA CGCCCGCCTG GGCGTGGCGA ACGCGGCCGG AGGGGTCCAC GGACGCAAGG TTACCTACGT GTGGCGAGAC GACGAGTCGG CATCGGCTGT CAACCTTGCT TCGGCACGAC AGCTCCTCGC CACGGACGAC GTATTCGGAA TGATCGAGGC CAGCGCGGAG GCGTTCGGTT CATCAGCACT TCTACACAGC TCCGGGATCC CAGTCGTGGG TATCGCGCTG GATCCCACCT GGGCCTCCAA TGACAACATG TTCAGCTTTA CGAACATGAT GGCAAATAAT TCTTCCATCA GCACCTGGGG TGATTTTGTT GCAGCCCAGG GCGGCCGTCG CGCATTGATA TTCAAGCCCA TCTTCTCCGC GGCTTCAGAC ATCCTTGCGA TGAAAATGTC CGACAGTCTG CAGGCGGCTG GTGTAGCCGT TGTCGGCAAT AACGAGATCA GCCCGATGAC GCTCGTTCCC GCCGTCATCG GCGAGCAGAT CAGAGCCACA GCAGCCGACA CCCTGATATT TGCCACAGAC GCTGAGAACT CCTATCGGAT CGTGGCAGCG GCCCGGGCAG CCGGTGCGGC AATCAGGGTC GCTCTGGTTC CGCCGGACGG CTACGACCCC CGAGCGCTCC ACGAATGGGG AAGCGCCATC GCGGGCACGT ACTCCTATCT TCCGATCACA CCGTTTGAGG TGAGCACCCC CGTCTACCGC GGGTTCTTCA ACGCCATGGC CGCCTACTCG GCCCAGTTGC AGCCGCCGAA TCAAACCTAC GCGGCGGAGG GCTGGATCGC CGCCGACATG TTCCTGCGCG GGCTGGCCAT GGCGGGGGGC TGCCCGACCC GCGCAGAGTT CATCAGCAGC CTGCGGTCCG TTCAGGCCTA CGACGCCGAA GGACTACTGC CCGCGTCGTT GAACATCAGC ACGAGCGTTG GCGAGATCAT CCGCTGCCTT CACTTCGTGC AGGTCGCGCC CGACGGGACC CATTTCACGC AGGTAACCCC AACACCGTTG TGTGGCAGGC GGCTGGCCAC AAACGACTGA
|
Protein sequence | MMKTLKRRRP FFMVSAVLCG ALILAGCSPV GEQPSVSTAS QACNTPGITA GEVRLGFLMS ESGAGASMAQ PFRAGVDARL GVANAAGGVH GRKVTYVWRD DESASAVNLA SARQLLATDD VFGMIEASAE AFGSSALLHS SGIPVVGIAL DPTWASNDNM FSFTNMMANN SSISTWGDFV AAQGGRRALI FKPIFSAASD ILAMKMSDSL QAAGVAVVGN NEISPMTLVP AVIGEQIRAT AADTLIFATD AENSYRIVAA ARAAGAAIRV ALVPPDGYDP RALHEWGSAI AGTYSYLPIT PFEVSTPVYR GFFNAMAAYS AQLQPPNQTY AAEGWIAADM FLRGLAMAGG CPTRAEFISS LRSVQAYDAE GLLPASLNIS TSVGEIIRCL HFVQVAPDGT HFTQVTPTPL CGRRLATND
|
| |