Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4133 |
Symbol | |
ID | 3681209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 5149845 |
End bp | 5151065 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637719479 |
Product | hypothetical protein |
Protein accession | YP_324627 |
Protein GI | 75910331 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00393583 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00385756 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGATCTGCA AACAGTACAA GAAAATACAT ACAAGCATAT CTCATCAAAC CACTCAAATT GCTCTGCATA CGGCAAAACC ACTCTTAGCT TTAGCCACAT TGATAGTAGC TGATGTACCA GCCCAAGCCA TTACATTTAA CTTTACCTAT CAGCCTGGAA TCACACAGGA ACAGATTGCA GCAGTTGAGT TAGCAGGAAA TATTTGGTCT GCTTACTTAC AAGATATAGA TGTTGTGGTC AACATCCATG TCGAGATGAC TGAAGGGGTT TTAGCTGAGG GTAAATTGGG CGGTACAACC CCAGCAATTA AAAAAATCAA CTATGACAAG TTCAAAGAGG GTTTGGGTGC AGATGGTACG GCTAATATCT ATCAGTTGCC AACATCTATC TACAGCACTG ATAAATATAG AACTAGACTA GCAGGCGGCA TCATCAATAA CAGTAACTAT GAACTGCTGA CAACTACAGC CAATAATAAA GCCCTCGGTA ATGACTTGAG TGGAGATGCT TCTGAATTAG ATGCCTATAT TCAACTGGAA AAATCGACTA ATTGGAGTTA TAGGTACGCT GGGGGCAAAA TTGAACAGAA CCAATATGAT TTTGTTAGTG TAGCTGTCCA CGAAATTGGG CATAGCTTGG GATTTATTAG TGGTCTTGAT GCTTTGAGTG GTTTAGCTTT ACCTACGGCT GTTGATATGT TTCGCTACTC AACTGAGAGT ACCAAACAAA GAGCAATTGA TTACACAGTG GGTGGAACTA AATACTTCTC AATCAACGGA GGACAAAATC CATTTAACTT TACTCAAATG GAAGGAAGCA CACCAACTGT ATACCAAGCA ATATTTTCCA GTGGCGAAAA CACTCTCTTG GGAGGCGATG GGGAGCAGGC TAGTCACTGG AAAATAGACA GCCAAACCTA TTTAGGGATT ATGTCCTCAA CCATCAGCAT GGGAGGAATT AAAAAAATTT CCAGACTCGA TCTCACCGTC CTCGATTACA TTGGTTGGCA GGTTGATTAT TTCCCAATCA TCAATTTGTC CGTCCTATCC ACAAATGCCC AAACCAAAGC ACAAAAAATT TGGGATTCAC AATTCGATAG TAATACAAAT AACGATGCCA TCCGCGATCG CTCTTCTGAT GTGCAGCAAA TGATCCAAAA AAGTGGCATT TATAACTGGG GTTGGAGTGG TTATTGGCAA ACAGCTCATC CTGCACCATA G
|
Protein sequence | MICKQYKKIH TSISHQTTQI ALHTAKPLLA LATLIVADVP AQAITFNFTY QPGITQEQIA AVELAGNIWS AYLQDIDVVV NIHVEMTEGV LAEGKLGGTT PAIKKINYDK FKEGLGADGT ANIYQLPTSI YSTDKYRTRL AGGIINNSNY ELLTTTANNK ALGNDLSGDA SELDAYIQLE KSTNWSYRYA GGKIEQNQYD FVSVAVHEIG HSLGFISGLD ALSGLALPTA VDMFRYSTES TKQRAIDYTV GGTKYFSING GQNPFNFTQM EGSTPTVYQA IFSSGENTLL GGDGEQASHW KIDSQTYLGI MSSTISMGGI KKISRLDLTV LDYIGWQVDY FPIINLSVLS TNAQTKAQKI WDSQFDSNTN NDAIRDRSSD VQQMIQKSGI YNWGWSGYWQ TAHPAP
|
| |