Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2626 |
Symbol | |
ID | 5671020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3107022 |
End bp | 3108581 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641241542 |
Product | hypothetical protein |
Protein accession | YP_001506962 |
Protein GI | 158314454 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAGGT CCAGTCTGCG GGCGGAGCAG GACGAGTTGC GGGCGCGGAT GCGTGCGGTC GGTATGTCCC ACGACGAGAT CGCGATCGAG TTCGCCCGCC GCTACCACTA CCGTCCCCGC GCCGCCCACC GCGTCGCGCA CGGCTGGACC CAGCAACAGG CCGCAAACCA CATCAACGCC CACGCCGCCC GCACCGGCCT CGACCCCCGC GGCACCGCCC CCATGACTGC CCCCCGGCTG TCGGAGCTGG AGAACTGGCC GCTACCGAAC AACCGCCGCC GGCCCACCCC CCAGCTCCTC GCCCAACTCG CCGAGGTCTA CGACACCAGC ATCCACAACC TCATCGACCT CGACGACCGC GAACACCTCA CCCCCGCCGA CACACTCCTG ATCAACACCA CACGCCGAGA CGCTCGATCG ACGCCGCCGG CAGGGTCCCC AGTGGCACTG TCACCACCCG CGGTCCCCCG CTCCAGACCT CCAGCCGGTC CGGGCTTCGA GTCGGAATAC GGCGGCGCGC CCAAGAGACT GCGTCTCGAT GCGTCTGCTG GGAACATCGA AGGGGTGGAC GCTCTCGGCC GCCGCGGGTT CACCCTCCTC GCCGGATCCG CACTCATGGC GGGCCTGGCG GGTAACGGCC GTGCCCGCCG CGTCGACCCG GCGCTCGTCT CCTATTTCGA CGGCCAGCTG AAAGGCCACT ACCACGCGGA CATGCTGCTC GGCTCCGGCG CGCTGATCGG CACAGTCGCC TCCCAATTCG AGGTCATCGC GCAGCTGGTG GACACAGCGG ACGGGTCGAC CCGCCAGCGC ATGGCGAAGG TCGGCTCGTC GTTCGCAGCG TTCGCGGCCT GGCTGTGGCT GGACGCCGGC GATCCGGTCG CCGCGATGCG CTGTCACGAC GCCGCCTTGG AGCTGGCGCA CCGCTGCGGG GAACGCGACG CCGTCGCCTG CGCGCTGGTC GACCGGGCGA TGGCCTTCAC CGACCTGGAA AACGCAGCGG CCGTGATCGA CCTGTGCCAG GCCGCGCTGG TCGATGCCCA GCACCTCTCG CCCGAGGTTC AGGTGTTCGC CTTGCAGCAG CAGGCGCACG GTGCCTCGCT GCGCGGTGAC CATCGCCAGG TCGATCTCCT GCTCGATCAG GCCGGCCGAC TCGTGGACCA GGTCGACGTC GAGGAGTGGG GCACGGCCTG TCGCCGTACC AACGGCTACG TCGAGGTGCA GCGTGCCACC TGCTACGGAC GGCTCGGACT GGCTGACGAT GCCGACCGTC TCTGGCAGCA GATCATCCCC GCCGCACATC CCTCAGCCCG CCGCGACGTC GGGGTCTGGT CGGCACGCCA TGCCGTCGCC GCCGCACAGC AACATGAGCC GGAACGGGCG GTGGAACTCG CGCGCCACGC GACCGCGCTC GCGATGGAGA CCGGCTCCGC GCGGGCCCGG CGAGAACTGG CCGCGGTCGC GGCGGCCATG GCCCCGTGGC GCACTCACCC CGTCGGCCAG GATCTGGCGG AGGTGCTCGC GCCCGTTACC ACCGACGAGA CCGGGATGGA TCATGGCTGA
|
Protein sequence | MSRSSLRAEQ DELRARMRAV GMSHDEIAIE FARRYHYRPR AAHRVAHGWT QQQAANHINA HAARTGLDPR GTAPMTAPRL SELENWPLPN NRRRPTPQLL AQLAEVYDTS IHNLIDLDDR EHLTPADTLL INTTRRDARS TPPAGSPVAL SPPAVPRSRP PAGPGFESEY GGAPKRLRLD ASAGNIEGVD ALGRRGFTLL AGSALMAGLA GNGRARRVDP ALVSYFDGQL KGHYHADMLL GSGALIGTVA SQFEVIAQLV DTADGSTRQR MAKVGSSFAA FAAWLWLDAG DPVAAMRCHD AALELAHRCG ERDAVACALV DRAMAFTDLE NAAAVIDLCQ AALVDAQHLS PEVQVFALQQ QAHGASLRGD HRQVDLLLDQ AGRLVDQVDV EEWGTACRRT NGYVEVQRAT CYGRLGLADD ADRLWQQIIP AAHPSARRDV GVWSARHAVA AAQQHEPERA VELARHATAL AMETGSARAR RELAAVAAAM APWRTHPVGQ DLAEVLAPVT TDETGMDHG
|
| |