Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0399 |
Symbol | |
ID | 5668823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 475659 |
End bp | 478193 |
Gene Length | 2535 bp |
Protein Length | 844 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641239332 |
Product | hypothetical protein |
Protein accession | YP_001504771 |
Protein GI | 158312263 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5635] Predicted NTPase (NACHT family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCCATC TGGCATCACC CAATGGCGAC CTGACACCTG GCATGCGTGA AGCACTAAGC CGCATGCGCG AAATAAGAAG AAGGTTCGGC CCGGCATATG CGGCGATGGA GAAAAGGTCG GGCGTAAGTC ATTCCAACTG GCACCGCTGG TTGCACGGAA AAGGGGTCAT GCCTTTGGAG AGGGTGAGCG AGGCGAAAGG CATGCTGATG GGTTGCCTAG AAAGGGAGAT GAAGAAGCAT TCCGGTGACG AGGATGTGCT ACGGACGCTA CGCGAGCTGA AAGAGTCGTT GGAGGAGGTG ACAGCGCTGT GGATCCGAGA CCAAGGATCC ACTACCGCGC AAATGGTGGC CAATGAAATG GTGACTAATG AGCAGCCCGA GCCCGATGTC CAGATCGAGA ACATATTGCC GGAACATGGT CAGATGTTTT CGAGCGACTG GATGACTGAC ATGGCCGAGG AGCTCGCGGC GGCAGTCGAG AGACAGTGGA TCTGGGAGGC CGACAGACGC GGATTGATCC GCCCTCCGGC GATCCCTGTA CGATGGCGCT GGGCGCAGGG CATGACCAGC AAACTTGACG TGGTCCTCAA TAACTCACAC CGGCGCACCC TGTTCCCTCC CCTACCGGGA GTGGCGTCGG CGACATCTGC GAGCTTGCAG TCCGGCGGCC TCCAGCAGCT TTTCGATGCA TATGCCGGGG TGGACTCCGG CCGCCTTGTC ATCATGGGCA AGTATGGCAG CGGAAAGTCT GCAACCGCCA TCTTGATGCT GCTCGACGCG TTGTCACACC GTGACAGCCT CGACGACGTG CAACGGGCGA AGGTACCCGT GCCGATATTT CTGACCACAC ACGACTGGGA CCCCCGACAC CAGAATCTTC CCGAATGGCT GATCACTCGA CTTGCTAAGG AATACCAATT TCTCCGATCA ACTCAGGACG GCCGGAGTGC TGCCGCAAGA TTGGTCGAAG ATGGTCGGGT TGCACTGTTC CTTGATGGCT TTGACGAAAT TCCATCCGAG CTTCATCGGG ACGCGTTGGA GGAGATCCGC CGACTGGCGA CGTTCCGGCT GGTCACACTG ACCCGCGACG GGGAGTTTGC CGCAGCCGTG CGTACGGGAC ACCATGTGGA CGCCGCCGCG GTAGTGGAAT TGTTGCCGAT CCCGGCCGAT GAGATCGCTT CTTACCTTGA GACCCGTCAG ACCGATCCCA TGCCCCCGCA GTGGGAGAAG CTTGTCACCT TTCTGCGAGA AAATCCGGAC CATATCCTGG CGCAAGCGCT AGACTCGCCG CTCGTCTTGA CGCAACTGCG AGACGCAATC CAAGATCCGG CTGACATCGA CGATCTCCTC GTCGCAGACA GGTTCGAGAG CCGAGAAGCG GTCGCAGAAT ATCTGATGGA CCGGTCCATC GATGTCGCCT ATCGGCGATT CTCCCGCGAT ACGTCCACCG TCGGTCCCGA CAAGGCGAGA GCTGCTCTAG GATATATAGC CGCGAAGATG AACGAGGAAA ACACCCGCGA CCTGGCCTGG TGGCAAATTC ATCACTGGGC CTCGCCTATT CCTCGGATCC TTGCGACGAC AGTTCTAGGC GTGCTGATAG GGGCACCTGT AGGCGCGTTG ATGTTCGGTC CGCTCGGCCA GTATGCGGTG AGAGGGCATA CTGGAACGTT GTTTGGGGCC CAGTATCTTT CCATGATGTG CCTTGTTTTT GGGCTGATGG CTGGACTCGT TTCGGAGGCC CGCGGAGGCC GCTCCCGTCG AACAGGCCGA TTCAGATGGG TCGACTGGTA TCGCGGTCAG ACTAACAGCG CGGTCGGTCT GCTGTTTACT GTTGCCGTCA CGATGGCCGT TGGTAACCAG TCCAATTACG CCTTCGGGGC ATTGGCGGGG GTTCTAGCAG GGATCGTGGC CGGGTATGCT GCGAGGGGCG ACCACCAAGA TCACAGGTGG ATACAGCGGT CTTGGTGGAT CACGCTTCGA TCAAGGCTCG ACCCGGTCGC AGGGGCTGTA GCGGGATTGC CGATCGGACT GACGTATGGA TTGACCAAAG AACATACTCA GGGCCTCGTG GCCGGTATCA TGAGCGCAAT CGCCTTCGGC CTCATGGTCG GCTTCGCGCG ACCGACGGCT GGTATTCAGG CTGTTACCGA TCCACGAACA TCCTGGCTCC GAAATCACGA ACATGCAGCC ACCTTCAGCC TGGCCGCTGG CCTAGCACTC GGGCTTCCGC TTGGATTGAA AAACGGGCTG GAGCACGGCG TCATCGCCGG CGCCGTCGCC GGCGTTTGCG TTGGGCTCAT CGTCGGACTT GGGTGTTTGA TCGGGGCGTC CGACAGGTTG CGGACCACCC TGCTGTTTCT CCAGCTGCGC GGCCACGGCA TTCCTTTGGA CGGAATGCGT TTCCTGGAGG ATGCGCGCCG GAAGAATCTT CTCCGTACCG TCGGACCGCT ATACCAGTTC CGGCATCCCA GCATTCAAGA CCGACTCGCG AGGACATACG GGCAGCAGCA AACCCGCGTC GATCCCGAAA TCTGA
|
Protein sequence | MPHLASPNGD LTPGMREALS RMREIRRRFG PAYAAMEKRS GVSHSNWHRW LHGKGVMPLE RVSEAKGMLM GCLEREMKKH SGDEDVLRTL RELKESLEEV TALWIRDQGS TTAQMVANEM VTNEQPEPDV QIENILPEHG QMFSSDWMTD MAEELAAAVE RQWIWEADRR GLIRPPAIPV RWRWAQGMTS KLDVVLNNSH RRTLFPPLPG VASATSASLQ SGGLQQLFDA YAGVDSGRLV IMGKYGSGKS ATAILMLLDA LSHRDSLDDV QRAKVPVPIF LTTHDWDPRH QNLPEWLITR LAKEYQFLRS TQDGRSAAAR LVEDGRVALF LDGFDEIPSE LHRDALEEIR RLATFRLVTL TRDGEFAAAV RTGHHVDAAA VVELLPIPAD EIASYLETRQ TDPMPPQWEK LVTFLRENPD HILAQALDSP LVLTQLRDAI QDPADIDDLL VADRFESREA VAEYLMDRSI DVAYRRFSRD TSTVGPDKAR AALGYIAAKM NEENTRDLAW WQIHHWASPI PRILATTVLG VLIGAPVGAL MFGPLGQYAV RGHTGTLFGA QYLSMMCLVF GLMAGLVSEA RGGRSRRTGR FRWVDWYRGQ TNSAVGLLFT VAVTMAVGNQ SNYAFGALAG VLAGIVAGYA ARGDHQDHRW IQRSWWITLR SRLDPVAGAV AGLPIGLTYG LTKEHTQGLV AGIMSAIAFG LMVGFARPTA GIQAVTDPRT SWLRNHEHAA TFSLAAGLAL GLPLGLKNGL EHGVIAGAVA GVCVGLIVGL GCLIGASDRL RTTLLFLQLR GHGIPLDGMR FLEDARRKNL LRTVGPLYQF RHPSIQDRLA RTYGQQQTRV DPEI
|
| |