Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0398 |
Symbol | |
ID | 3682676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 514535 |
End bp | 515389 |
Gene Length | 855 bp |
Protein Length | 284 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637715727 |
Product | histidine triad (HIT) protein |
Protein accession | YP_320919 |
Protein GI | 75906623 |
COG category | [F] Nucleotide transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0537] Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.315945 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAGC AAAAAAATCA ATTCAGCCAT CTCACCGCCA TTGAAAGAAC TTATCTATCA TTTCCTGCAC AGTTTTTAAT CAATCAAAAC CTGCTACAAG GACAAATATT AGACTTTGGT TGTGGATTTG GTAATGATGT TAAATTATTG CAGCACAAAG GCTTTGATAT TAGAGGTTAC GACCCTTATT ATTTTCCTCA ATATCCTGAA AATAAATTTG ACACTATAAT TTGCTTGTAT GTTTTAAATG TTTTATTCCC TGAAGACCAA GCTAATATCC TCATGGATAT AGCCTACTTA TTAAAACCAG GAGGTAAAGC GTATTATGTA GTCAGAAGAG ATATTAAAAG GGAAGGATTT CGGGAACATT ATATTCATAA AAAACCTACA TATCAATGTC TTGTTAAACT GCCTTTTCGC TCAATTCATT TAGATGAAAG CCGAGAAATA TATGAATACA CCCACTATAA TAACCAGCGC CATTCATCTA ATTACTGTAT ATTTTGCAAT CCCCATAAAA ACTTAAAATT ATTAACAGAA TCAGCAACCG CCTACGCTAT ATTTGATGGT TATCCGATCA GCAAAGGACA TACATTAGTT ATTCCCAAAC GCCATGTTAG CGACTACTTC GAGCTACCCC AAAAAGAGCA ATCAGCCTGT TGGTTAATGG TGAATAAAGC ACAGGAATTT CTAAAAGCCG AGTTTTCCCC TGATGGCTTT AATATAGGTA TGAACATCAA TCGAGCCGCC GGACAAAATA TTATGCACGC GAGTATCCAC ATCATCCCAC GTTATCAAGG TGATGCGATA GGAGCGAAAA GTGGCATAAG AAACGTTATC CCTCAAAGAA AATAG
|
Protein sequence | MKQQKNQFSH LTAIERTYLS FPAQFLINQN LLQGQILDFG CGFGNDVKLL QHKGFDIRGY DPYYFPQYPE NKFDTIICLY VLNVLFPEDQ ANILMDIAYL LKPGGKAYYV VRRDIKREGF REHYIHKKPT YQCLVKLPFR SIHLDESREI YEYTHYNNQR HSSNYCIFCN PHKNLKLLTE SATAYAIFDG YPISKGHTLV IPKRHVSDYF ELPQKEQSAC WLMVNKAQEF LKAEFSPDGF NIGMNINRAA GQNIMHASIH IIPRYQGDAI GAKSGIRNVI PQRK
|
| |