Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3675 |
Symbol | |
ID | 5672041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4350277 |
End bp | 4351503 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242558 |
Product | hypothetical protein |
Protein accession | YP_001507978 |
Protein GI | 158315470 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.389336 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACAA CTCGTTCACT GTCGTGGTTG TCCGCCGTGC TGTTATGCAC CGCCGCCCTG ACCGGCTGTC AGGGCGGCGC GACGCAGAGC GCGATGTCCT GTACCACCGA GGGTGTCACC GCGGACGAGG TCAAGCTGGG GCTGTTGTTT CCGGACACCG GGCTCGGTGC GGTGACGTTC AGTGCCGCCC GGGCCGGAAT CGACGCCCGA TTCGGAGCGG TGAACGCGGC CGGCGGTGTG CACGGGCGCC AGATCGTCTA TGACTGGCGA GACGACACGG CCAAGGTGTC GATGAATCTC ACGATGGCCC GCACACTGAC GGAGCGGGAA AGCATCTTCG GCATGCTCGA GACGAGCACG GTGGCCTCGG GTAGCGCCGC CTATCTCGCG GAGCGCGGGA TTCCGGTGCT CGGGATCGCC GTGGAGGACG CGTGGGCGAA GTACCGGAAC ATGTTCTCCT TCAACTACAG TTTCACGGCG AAGGGCTCGG CCGACACGTT CGGCAAGTTC GTCCATGAGC GCGGCGGGAC CAAGGCGATG ATCCTGTACA ACCCGCTCGA CCCGACCGTG TCGACGCACA TCGCCGAACA GTTCACCTCC AGCTTCCGGT CGGTGGGAAT CACGACGACC TCGGTCGGCA CCGACGACAA CCCCGTCTCG GCCCAGGCGG ACCAGCTCGC GCGGCAGATG GCCGAGGAGG GCATCGACAC GCTGGCCGGC ACGCTGAGCA CGGAGGGCCT GGCCAAGGTC GTCGCGGCCG CGCGGCGCCA GGGCGTCCCG CTCAAGGTCA TCCTCAGCAG CAGCCCCGCG CCGAACGCCG AGCTGTTGGA GACCTACGGC TCGCAGCTGG CCGGTCTGAC GACGTTCGCC GCCTACATAC CGCTGGAGAC GAAGTCGCCC GCGCTCGACG CCTACCGCGC CGCGATGGCC ACCTACGCCC CGCAGCTGCA GGACACCGAC CAGACGCTGG CGCTGGTCGG CTACATCATC GCCGACATGT TCATCCGGGG CCTGGAGGAG GCGGGGGACT GCCCGACCCG GCAGGGCTAC ATCGACGGCC TCCGGGCGGT GAAGGGCTAC AACGCCGGTG GCCTCATCGG CGACATCGAC CTCGAACGTG ACTTCGGCAA GCCCGCCGAG TGCTACTCGT TCGTCGAGGT GAACCCCGAA GGCTCCGCCA TCGAGATCGT CAGCCCGAAC TACTGCGGAC ATCGGCTCGC CGACTGA
|
Protein sequence | MATTRSLSWL SAVLLCTAAL TGCQGGATQS AMSCTTEGVT ADEVKLGLLF PDTGLGAVTF SAARAGIDAR FGAVNAAGGV HGRQIVYDWR DDTAKVSMNL TMARTLTERE SIFGMLETST VASGSAAYLA ERGIPVLGIA VEDAWAKYRN MFSFNYSFTA KGSADTFGKF VHERGGTKAM ILYNPLDPTV STHIAEQFTS SFRSVGITTT SVGTDDNPVS AQADQLARQM AEEGIDTLAG TLSTEGLAKV VAAARRQGVP LKVILSSSPA PNAELLETYG SQLAGLTTFA AYIPLETKSP ALDAYRAAMA TYAPQLQDTD QTLALVGYII ADMFIRGLEE AGDCPTRQGY IDGLRAVKGY NAGGLIGDID LERDFGKPAE CYSFVEVNPE GSAIEIVSPN YCGHRLAD
|
| |