Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3866 |
Symbol | |
ID | 5672229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4595342 |
End bp | 4596736 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242744 |
Product | hypothetical protein |
Protein accession | YP_001508164 |
Protein GI | 158315656 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.679852 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGACCCC GTAGAACTGT CGCGGCCGTC GCTGCCCTCG CAGCGGTGAC CGTCTTCGTC GCAGGATGTG GCCGCGACTC GGGCGGCGAA TTAGGGAGTG AGGACACTCC CGCCACCCAG GGGTCCACGG CCGCCGCAGC GGGTGACTTC GGCGACCTGA AGGACGTCTG TGGCACCGGC GACCCGAAGG GCGCGCCCGC TCAGGGCGTG ACCGCGAGCG AGATCAGCGT CGGTGTCTTC AGTGACGTCG GCTTCACCAA GAACTCTGAG TTCGACGACG CCGCCAAGGT GTTCACCTCC TGGTGTAACG AGGCCGGCGG CATCAACGGT CGCAAGATCG CCTACAACCT GCGGGACTCG AAGATCTTCG AGACCCGCCA GCGGATGATC GAGTCCTGCC GCGAGGACTT CGCGATCGTC GGCGGTGGCA GCGCGCTCGA CGCCGGCGGC GTGGAAGAGC GCCTCAAGTG CCTGCTCCCC GACATCCCCG CGCAGACGAG CCAGCCGGAG AACATCGGCT CGGACCTGCA GATCGACGCG ATCGGCGCCG GGCACTCCTA CATCCGCTAC GCCGGTTACT TCAACTGGCT GCTGAAGGAG GCCTACCCGG CCTCCGCCGG TGCGGTCGGC ATCATCGCCG GTGACTCCCC GGTGACCAAG GTCATCGGGG ACCAGACGGT GGAGGCCGTG CAGAAGGCCG GCGGGACGGT CGCCTACAAC GACCTCTACC CGGCGGCCGG CGTCTCGGAC TGGACGCCCT ACGCCCAGGC GCTCAAGAGC AAGGGCGTGA AGGGCCTGGT CTTCCAGGGC GACTTCCGCA GCCTCGCGAA GCTCGAGCAG GTGCTGTCGT CGATCGACTA CAAGCTCGAC TGGATCGACG CCAACAGCAA CGCCTACGGA TCCGCGTTCG TGGAGCTCGC CGGCGACGCC ATCAGCACCC AGAACAACCT GGCCGACCTC AGCGGGGTCG CGCCTCTCGA GGTGGCGGAC GAGATCCCCG CCGTCCAGAA GGTCCTGGAC CTCTACAAGG AGTACGCGCC CGACGCGGAG GTCACCTTCC CGGCACTGCG CGCCTTCTCG TCCTGGCTGC TGTTCGCGGA GTCGGCCAAG GAGTGCGGGG ACGACCTCAC CCGCAAGTGC CTCTACGACA CGGCGCGCGA GCAGACCAAG TGGACTGCCG GTGGCCTGCA GGCCTCGGTC GACATCACCA AGGCCGACGC GCCGCTGAAG TGCTTCAACG TCGTGCAGGC GAGCGCGGAC GGCTGGAAGC CCGCGGACTT CGAGCCTGAC ACCGGTGTGT TCCGCTGCGA TGCCCCTTCC GTCAAGTACA CGGGCTCGTA CGGCACGCCG CTCACCCTCG CCAGCGTCGG CAAGAGCCTG AGCGACCTCA AGTAA
|
Protein sequence | MRPRRTVAAV AALAAVTVFV AGCGRDSGGE LGSEDTPATQ GSTAAAAGDF GDLKDVCGTG DPKGAPAQGV TASEISVGVF SDVGFTKNSE FDDAAKVFTS WCNEAGGING RKIAYNLRDS KIFETRQRMI ESCREDFAIV GGGSALDAGG VEERLKCLLP DIPAQTSQPE NIGSDLQIDA IGAGHSYIRY AGYFNWLLKE AYPASAGAVG IIAGDSPVTK VIGDQTVEAV QKAGGTVAYN DLYPAAGVSD WTPYAQALKS KGVKGLVFQG DFRSLAKLEQ VLSSIDYKLD WIDANSNAYG SAFVELAGDA ISTQNNLADL SGVAPLEVAD EIPAVQKVLD LYKEYAPDAE VTFPALRAFS SWLLFAESAK ECGDDLTRKC LYDTAREQTK WTAGGLQASV DITKADAPLK CFNVVQASAD GWKPADFEPD TGVFRCDAPS VKYTGSYGTP LTLASVGKSL SDLK
|
| |