Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6957 |
Symbol | |
ID | 5675270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8479090 |
End bp | 8481048 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641245806 |
Product | hypothetical protein |
Protein accession | YP_001511197 |
Protein GI | 158318689 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.304536 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCAGG AACGAGAACC GGAAAGTCCG GTGGGAGTTC ATGATCTGAT ACGTCATCAC TGGATTTCGA GTAGGGATTT CGACGCGCTC GCGTCCGGAA CCGATCAACA CGGGCTGATC TGCCAGCTTC GGGTTGCCGA ACGTAGTTAC CGATATCTCT CGCTCCGTAG TATCCTTGAT TTCGCCCGCG GGCATGAGCC GACAACCGGG CTGCTGTCGT CTCCTGACAC GGCATGGGAC CTTCTCGTGG AGGCCGAGCG CGTCGCTCCG GCCATGGTCA CAGCCATCCT GGACCTACCG AGCGTCGGGG CCTGGGTGGC CCGGGCGCTG CGGCGGACCC GTGGGCTTCT GTACGACGAG ATTCCGCTGT GGGTCGACCT CGGCTATCTC CATCTGCTCT CGGCGGCCGC CGGTATCCGA TGCGGCATCC CTTTTAGGCT GGATGTTCCG CTACGCCACG GTCAGCTTCA CCTTCCGACG CTCGGCTCCA TCGTGCTGCC GGGCAAGGAG ATCTGGGGTT CGACGACCGT CATCTCCGAC GGGCGATCGG CGCATGCCCT CCTGCCTGTG GGCAAGATCC GTCTCGCGGG GCGCGACCCG GTGGACGATC CACCCGGCTG GAGGAGAACG ACGAGCCTGG ATGCGCGGCA CCGCGGGGCG GCGGCCACCG TCTACCTTGA CAGCAGCGAC CCCTATCGCA TGGTCGAAAC GCCCGCGCTG CCCGAGAACA TCGGCATACC CGTCCAACGG CACTGGGAGT CCCTGTTCCA GGAAGCCTGG GCGGAACTGG TGGAACAGGA CAGCGAGGTT GCCCGATGCG TGGCCGAGTG CACCCTCACG CTGGTCCCGC TGCCCCGGGC AGAGCGGTTC AGGGAAAGAA GCGCATCGTT CGGCGACTCG TTCGGCGGGG TTATTCTATC GCTGCCGGAT AGCCCGGAGC GGTTCGCTGT GACCCTGGTA CACGAAATGC AGCACGCAAA ACTCGGCATC CTACTCCATC TATTCTCGTT CTTGCGCGGG GAAGGAAGCA TGCTTGCCTA TGCGCCGTGG CGTGACGATC CCCGACCCCT GCAGGGCCTG CTGCAGGGGA TCTACGCCTT TTTCGGCGTC GCCGGATTCT GGCGTCGTCG CTTCGCCATG GCGAAGGGAG AGGAAGCGGC TCTCGCCGGC TTCGAGTTCG CCCTGTGGCG CGGCAAGGTG AGCGAGGCCA CCGCACACGC CAGGAGTCGT CAGGAGTTCA CGGCCCTCGG CCATCGCTTT CTGACCGGGA TAGCCACCAC AGTCACGGCG TGGTCGGCCG AACCGGTGTC GCCGTACTAC GCAGGGCTGG CGAATCTCGC CGCGGCGGAC CATCGTGCGG GCTGGCGCGT CCACCACCTC ATCCCTCCGG CCGCCGACGT CGCGAGCCTG TCCCGGGCCT GGATCGCCGG GACGGATCCG CGTGAGCTTC GTGTCCCCGG CCCGTCCGCG CTCGTCCCGG ATGGGAAGGT GCCCGACCTG GACACCCTGG TCGTCCTGGT CCGGTACTGG TTGGCCGACC GGGAGCTCTT CCGCCAGATC GAACGGAATG GCCAGGTCGG CAGCGTGGTC ACCGGCGCGA CGGCGGCGGA TCTCCACCTG GTCGCCGGAC GCCACGACGA GGCCGCCCGT GCCTATCTCG ACGAACTCGG GGAGCCCTCG CCCGGACTGA CTGCCTGGAC GAGGCTCGGA TCAGCGCTCG CCACGAGCCC CGACCACCGG ATGGCGGCGG CCGGCCGTGC CCTGCTCACC CGGCCCGAGC TCGTCCGGGC GGTGGCCCGT GCCGTCGAGG CGGACACCGG CCGCAGACCA GCGCCGGTCG ACCTCGCCGG ATGGCTGGGT GAGCTTCCCC CGAACGAGGA CCGGGTCCGC GGCACGGGGG CGGACGACGT GTCCGCCCCC GCCCCGAGGC CCGGCGACTC GCAGGAAGCA GTGCACTGA
|
Protein sequence | MRQEREPESP VGVHDLIRHH WISSRDFDAL ASGTDQHGLI CQLRVAERSY RYLSLRSILD FARGHEPTTG LLSSPDTAWD LLVEAERVAP AMVTAILDLP SVGAWVARAL RRTRGLLYDE IPLWVDLGYL HLLSAAAGIR CGIPFRLDVP LRHGQLHLPT LGSIVLPGKE IWGSTTVISD GRSAHALLPV GKIRLAGRDP VDDPPGWRRT TSLDARHRGA AATVYLDSSD PYRMVETPAL PENIGIPVQR HWESLFQEAW AELVEQDSEV ARCVAECTLT LVPLPRAERF RERSASFGDS FGGVILSLPD SPERFAVTLV HEMQHAKLGI LLHLFSFLRG EGSMLAYAPW RDDPRPLQGL LQGIYAFFGV AGFWRRRFAM AKGEEAALAG FEFALWRGKV SEATAHARSR QEFTALGHRF LTGIATTVTA WSAEPVSPYY AGLANLAAAD HRAGWRVHHL IPPAADVASL SRAWIAGTDP RELRVPGPSA LVPDGKVPDL DTLVVLVRYW LADRELFRQI ERNGQVGSVV TGATAADLHL VAGRHDEAAR AYLDELGEPS PGLTAWTRLG SALATSPDHR MAAAGRALLT RPELVRAVAR AVEADTGRRP APVDLAGWLG ELPPNEDRVR GTGADDVSAP APRPGDSQEA VH
|
| |