Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_1974 |
Symbol | |
ID | 3681537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 2448511 |
End bp | 2449785 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637717315 |
Product | hypothetical protein |
Protein accession | YP_322491 |
Protein GI | 75908195 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00936884 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.989231 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAA AAACGTGGCT GCAAATACCT GCTTTTAGGT CAAGAAATTA TCGCTTGTTT TTTGCAGGAC AAGGAATTTC TCTAATTGGC ACTTGGATGA CACAACTAGC CACTGTTTGG CTAGTTTATA GCTTAACTAA TTCCCCTTTA ATGTTGGGAG TTGTGGGATT TACTAGTCAG ATTCCTAGTT TCTTTCTTGC ACCTTTTGGA GGCGTATTTG TAGACAGATT TTCCCGTTAT CGAACCTTGA TTGGTACACA AGTACTAGCA ATGTTTCAGT CGTTAACCTT AGCAGCGTTA ATGTTCACAG GTGTCATTCA AATTTGGCAT ATTATCGCCT TAAGCTTACT CCAAGGAATG ATTAACGCCT TAGACGCACC TGCTAGACAA GCCTTTGTCC CAGAACTCGT GCAACGCAGA GAAGATATAG CTAATGCGAT CGCCATCAAC TCGACTATGA TTAATGGGGC GCGATTAATT GGCCCAGCCA TTGGGGGCTT ATTAATATCC TGGGTGGGTG TAAAGTATTG TTTTCTAATA GATGGCTTGA GTTATATTGC CGTCATTGCT AGTTTATTGG CGATGAAAGT TAAACCTTGG ACAGTGACTA GAATTGATGG TAATCCCTTA CAGCAAGTCA AAGAAGGATT TATTTACGCC TTTAGCTTTC CACCAATTAG AGCGATTTTA TTACTATCAA CTTTAGTGAG TTTGATGGGA TTACAAAATA CTATCCTTGT GCCAATTTTT GCTGAAACTA TTCTCAAAGG TGGTGCGGAA AGCTTAGGAT TTATCATGGC AGCTTCAGGA CTGGGAGCCT TATCTGGTGG TATTTATTTA GCCAGTAAAA AAACAATTCT AGGCATTGGT AAACTCATTG CTATAGCTCC AGCAATTTTA GGATTTGGAC TAATTGCTTT TGCCATTTCT CGATATTTAC CTCTTTCTCT ATTCACCATG TTGTTTGTTG GCTTAGGAAC AATTTTACAA ATAGCTGCTA GCAATACATT TCTGCAAACA ATTGTCGAAG AAGATAAGCG TGGTAGATTG ATGAGCTTAT ATACCATGTC ATTTTTAGGG ATGATACCTG TGGGTAATTT ATTGGGTGGT GCATTAGCCA ATAGAATCGG CGCACCTAAT ACATTAATTA TTGATGGTAT AGCTTGTATT ATCGGTTCAA TATTATTTCA AAGAGAATTA CCAAAGCTGA GAAAATTAAT CATGCCAATT TATGAGCAAA AAGGTATTGT AACAGTTGAG AATAAAAGTG CTTAA
|
Protein sequence | MNKKTWLQIP AFRSRNYRLF FAGQGISLIG TWMTQLATVW LVYSLTNSPL MLGVVGFTSQ IPSFFLAPFG GVFVDRFSRY RTLIGTQVLA MFQSLTLAAL MFTGVIQIWH IIALSLLQGM INALDAPARQ AFVPELVQRR EDIANAIAIN STMINGARLI GPAIGGLLIS WVGVKYCFLI DGLSYIAVIA SLLAMKVKPW TVTRIDGNPL QQVKEGFIYA FSFPPIRAIL LLSTLVSLMG LQNTILVPIF AETILKGGAE SLGFIMAASG LGALSGGIYL ASKKTILGIG KLIAIAPAIL GFGLIAFAIS RYLPLSLFTM LFVGLGTILQ IAASNTFLQT IVEEDKRGRL MSLYTMSFLG MIPVGNLLGG ALANRIGAPN TLIIDGIACI IGSILFQREL PKLRKLIMPI YEQKGIVTVE NKSA
|
| |