Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0947 |
Symbol | |
ID | 3681167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 1141853 |
End bp | 1143400 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637716281 |
Product | hypothetical protein |
Protein accession | YP_321466 |
Protein GI | 75907170 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAATA GAAACAAACA CATTTCCCAA GTTGTAGTTA CTGCTGCACA AATGCGAGAT ATAGAAGCGC GTATATTTGC AGCCGGAATG CCTGTATCTG CTTTGATGGA AAAAGTAGCG GGATTAATTG CCAAACGCAT TCAAGAGATT TGGGAACCAT CTAATCATGT AGGAATCCTC ACAGGGCCAG GACATAACGG CGGTGATGGG TTGGTTGTGG CAAGGGAACT GCATTTTAGG GGTTATGATG TTTGGATTTA TTTGCCTTTT GATAAGCTGA AGGAGTTAAC ATCACAACAT TTACAATACG CTCAGAGTTT AGGTATACCT TGTTATCAAG AAATTGAGCA ATTACCAGAT TGTGATTTTT TGGTTGATGG GTTGTTTGGG TTTGGTTTAG AAAGAGAAAT TACTGATCCC ATCGCTTCAG TAATTAATCA GTTGAATGAA TGGAACAAGC CAATTATCAG TATTGATTTA CCTTCGGGTT TGCACACTGA TACAGGCGAG GTTTTAGGGA CGGCGATTCG CGCCAACTAT ACCTTGTGCT TAGGTTTATG GAAACAAGGT TTGTTGCAGG ATCAGGCTTT AGATTATATT GGCAAAGCTG AGTTAATTGA TTTTGATATT CCCTGGGCTG ATGTGCAAGC TGTATTGGGT GATGTACCCA AAGTCAAACG CATTACACCA ACAACAGCCT TATCTACTTT GCCTTTACCT CGTCCACCAG TGACGCACAA GTATAAAGAA GGGCATTTAC TATTGATTTG TGGTTCTCGC CGTTATGCAG GTGGGGCAAT TTTGACGGCT TTGGGTGCTA GGGGTAGCGG TGTCGGGATG TTATCAATTG CTGTACCCGA ATCTCTCAAG CAGCTGTTGG TATCACATTT GCCAGAAGCT TTAGTGATTG GTTGTCCAGA GACGGAAACT GGAGCCATCG CTCAACTACA ATTACCAGAG AATACTAACT TAAGTTCCTT TAGTGCGATC GCCTGCGGCC CTGGTTTAAC GAAAGATGCT ATATCTATTG TGCAGGAAGT ATTAGCAAGC GATCGCCCTT TAGTTCTCGA TGCTGATGGT TTAAATATTT TGGCACAGTT GGGAACAATC CTCACATTAC AACAGCGCCA AGCTGCAACA GTACTCACAC CCCATACAGG CGAATTTCAC CGATTGTTTC CTGATATTGC TGATGCTAAA GGCGATAGAG TCAAAGCCGT GCAGGAAGCA GCAGCCCAAA GCGGTGCAGT GGTATTGTTA AAAGGAGCGA GAACTGCCAT AGCCAATCCT CAAGGTTCAG TATGGATTAA TCCTGAAAGT ACACCAGCTT TAGCTCGTGG TGGTAGTGGC GATGTGTTAA CGGGGTTATT GGGTGGATTG TTGGCACAAG CGGTAAATAA AGCAATACCT GTAGAGGATA TTGTGGCTAC GGCTGCATGG TGGCACGGAC AAGCAGGTAT TTTAGCCGCC AGTGAGCGTA CAGAGTTAGG CGTGGACGCA TTTACGTTGT CACAGTATTT GTTGAAAGTG ATTGTTGGTC GCTCTTAG
|
Protein sequence | MLNRNKHISQ VVVTAAQMRD IEARIFAAGM PVSALMEKVA GLIAKRIQEI WEPSNHVGIL TGPGHNGGDG LVVARELHFR GYDVWIYLPF DKLKELTSQH LQYAQSLGIP CYQEIEQLPD CDFLVDGLFG FGLEREITDP IASVINQLNE WNKPIISIDL PSGLHTDTGE VLGTAIRANY TLCLGLWKQG LLQDQALDYI GKAELIDFDI PWADVQAVLG DVPKVKRITP TTALSTLPLP RPPVTHKYKE GHLLLICGSR RYAGGAILTA LGARGSGVGM LSIAVPESLK QLLVSHLPEA LVIGCPETET GAIAQLQLPE NTNLSSFSAI ACGPGLTKDA ISIVQEVLAS DRPLVLDADG LNILAQLGTI LTLQQRQAAT VLTPHTGEFH RLFPDIADAK GDRVKAVQEA AAQSGAVVLL KGARTAIANP QGSVWINPES TPALARGGSG DVLTGLLGGL LAQAVNKAIP VEDIVATAAW WHGQAGILAA SERTELGVDA FTLSQYLLKV IVGRS
|
| |