Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0519 |
Symbol | |
ID | 5668938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 604272 |
End bp | 605528 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641239448 |
Product | hypothetical protein |
Protein accession | YP_001504886 |
Protein GI | 158312378 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTTTCG GTAGATTGTC GATACGATGG ATAGTGTCGG TTGCAATTGT CGTTGTTGTG GCTGGCTGTA CTTTGCCGGG CAGTGGGAAA GAGCGACCAG TGGCGGTTGG CTGTGACAGC CCGGGAGTCA CCTCCGACCA GGTAAAACTG GGGTTGGTGT TCTCTGACTC GGGTATTGGT AGCACTGCAC TTTCCTCGGC CCGGTCCGGG GTGGATGCTC GGATCAACCT GGCCAACGCT CAAGGTGGCA TCCACGGTCG TAGGATTTTT TACCAGTGGC GCGATGACGC GAGTTCCTCG TCACAGAACG CGCTGGCCAC TCAGGATCTT GTGCAACAAG AATCCGTGTT CGGTCTCGTG GCAGCTACCG CCTCGCTCGA AGGTTCACTG GCTCGTCTCG ACGCGCAGGG CATCCCCGTG GTAGGTATCG CGCTTCCGTC TTGGAACAGG TATCGTAACC TTTTTTCCCA CCTCTACATG CCCTCTCCGG GAACGGTCGC CCGCTATATC CAGGCCCACG GTGGGACGAG AATCGCTGTT GTTACCACCG GCACTGTAGC CTTAACCATG GAGACCATCA CTCAGTACAA AAATGCCTTC AGCGCTCTCG GACTCGCCGC GACCGATCCC ATCCCGTACA CGAGCAGTAG CGACAGCCCG CAGCGCATCG TCCACCAGCT CGCGGCTATC CACGCCGATG CCCTGATCGG CTTCACCGCG CCAGAGGATC TCGCTGACAT CGCGCAGGCA GCTCGTGCCG CGAACCTACG CCTGAACGTC GACGTTTCCC TGACCGGCTA TGACAAAGCC CTTCTTCCCG CGTTCGGCCA AGCGCTAGCT GGTGTGTCTA TCCCCGTGTA TTTTCGTCCG TTCGAGGCGG GAGGCCCAGC CATCGACCGC TACCGCGATG CAATGACACT CTACGCACCT GAAAGCATCG AACCCGACCA GCAGTTCGCC ATGCTCGCCT ACATATACAC CGACCTGTTC CTACACGGAC TTGACCTAGC CGGCACCTGC CCAACCCGTG AAGGGTTCAT CAAGGCCCTA CGGGGTGTCA CCGATTATGA TGCGGGCGGT CTGATCTCGC CCGTCGACCT GAGTGCCAAC TCTACCCGCC CCCTCGATTG TTTCGCGTTC GTCCGTGTCA ACTCCACCGG CACTGCGTTC GACGTCGTAC ATCAACGACT CTGCTCCGAC GGCTCGGAAT CCCTGCCGCC GGGAAATGAA TCCACGCCAA CCGGCCGTAG TCGGTAA
|
Protein sequence | MIFGRLSIRW IVSVAIVVVV AGCTLPGSGK ERPVAVGCDS PGVTSDQVKL GLVFSDSGIG STALSSARSG VDARINLANA QGGIHGRRIF YQWRDDASSS SQNALATQDL VQQESVFGLV AATASLEGSL ARLDAQGIPV VGIALPSWNR YRNLFSHLYM PSPGTVARYI QAHGGTRIAV VTTGTVALTM ETITQYKNAF SALGLAATDP IPYTSSSDSP QRIVHQLAAI HADALIGFTA PEDLADIAQA ARAANLRLNV DVSLTGYDKA LLPAFGQALA GVSIPVYFRP FEAGGPAIDR YRDAMTLYAP ESIEPDQQFA MLAYIYTDLF LHGLDLAGTC PTREGFIKAL RGVTDYDAGG LISPVDLSAN STRPLDCFAF VRVNSTGTAF DVVHQRLCSD GSESLPPGNE STPTGRSR
|
| |