Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6824 |
Symbol | |
ID | 5675137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8315369 |
End bp | 8316574 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641245673 |
Product | hypothetical protein |
Protein accession | YP_001511064 |
Protein GI | 158318556 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.759789 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAAGA CTGGAGTGGT CGGACCGTTA TCCCTGGTAG CCGCGATCGT CGTGGGTGCC CTGACAGGGT GCTCGGCATC GGGAGCAGGA GCCGGGGCGA ACGGCCCCTG CGACAGTCCC GGAGTCACGG CCGACCAGGT CAAGTTCGGC TTCGTCTTCT CCGACACGGG CACGGGCAGC GAGGCGCTCT CGTCGGCCCG TCTGGGAGTT GACGCCAGGA TCGGGCTGGC CAACGAGACG GGAGGGGTCA ACGGCCGCCG CGTCACCTAC GACTGGCGGG ACGACGCGGC CTCCTCGTCC ACGAACGTGC GGGTGACCCA GGATCTCAGC AGTTCCACCT TCGGCCTGGT GGGAGTGACC TCCGCTGTCG GTGACTCCCT CGACAACCTC GAGAAGGAGG GAGTTCCATA CGTCGGTCTC GTCCAGCCCT CCTACGCCAA GTACCCGAAT GTCTTCGCGC ACCTGTACGA GGCGGCGCCG GAGACGATCG GCCGCTACTT CCAGGCCAAC GGCGGGACGA AGGTCGCCAT GGTGAGCACC GGGGCGTCAG CGTTCACGCA GGAGGTCGCC GGGCGGTACC GCAGCGCGTT CGAGGCCGTC GGCCTGCAGG TGGCCGCGCT GATTCCCTTC GCGGCCAGCG TCGACAGCCC GGCGCGGGTG GCCCAGCAGA TCGCCGGCAG CGGCGCGGAC GTGCTCATGG GCTTCACCAC CGTGGACGAC CTGGCCGCCA TCGTGCGGGC CACCCGCCAG GCGAACCTGC GCCTCGCCGC GAGCGTCTCG ACCAGCGGGT ACGACCGCGG CGTGCTGACC TCGCTGGGGT CGTCGCTCAG CGGGGTCTCC TTCCCGGTCT ATTTCCGCCC CTTCGAGGCG GGCGGGCCGG CCATCGACCG CTACCGCGAC GCGATGACCC GTTTCGCGCC GCAGGCCGTC CAGCCCGAAG AGAAGTTCGC TGTGTACGGA TACATCTATG CGGACATGTT CCTGCGCGGA CTTGAGCTGG CCGGCGACTG CCCGACCCGC GAGGGCTTCA TCAGCGCGCT GCGGAAAGTG ACCGACTACG ACGCCGGCGG GCTCATCGAG CCGACCGACC TGCGCACCAA CGCGACCACC CCGCTCCAGT GCGCCGCGTT CGTCCAGGTC AATCCGGCCG GTGACGCGTT CCAGGTGGTG CGCGAGCGAC TCTGCGCCAA CGGCCAGGGG AACTGA
|
Protein sequence | MRKTGVVGPL SLVAAIVVGA LTGCSASGAG AGANGPCDSP GVTADQVKFG FVFSDTGTGS EALSSARLGV DARIGLANET GGVNGRRVTY DWRDDAASSS TNVRVTQDLS SSTFGLVGVT SAVGDSLDNL EKEGVPYVGL VQPSYAKYPN VFAHLYEAAP ETIGRYFQAN GGTKVAMVST GASAFTQEVA GRYRSAFEAV GLQVAALIPF AASVDSPARV AQQIAGSGAD VLMGFTTVDD LAAIVRATRQ ANLRLAASVS TSGYDRGVLT SLGSSLSGVS FPVYFRPFEA GGPAIDRYRD AMTRFAPQAV QPEEKFAVYG YIYADMFLRG LELAGDCPTR EGFISALRKV TDYDAGGLIE PTDLRTNATT PLQCAAFVQV NPAGDAFQVV RERLCANGQG N
|
| |