Gene Ava_1059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1059 
Symbol 
ID3678572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1289541 
End bp1290719 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content48% 
IMG OID637716395 
Productpeptidase M20D, amidohydrolase 
Protein accessionYP_321578 
Protein GI75907282 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000996259 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000027158 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCTCACCC TTATTAAAGA CTTAGCCACA AAACTAGCGC CTCGCCTCAT TGAAATTCGC 
CGTCACATCC ATTCTCATCC CGAACTTAGC GGTCAAGAAT ATCAAACTGC TGCCTTTGTG
GCTGGTGTCT TATCTTCTAG TGGTCTGCGT GTACAAGAAG GTGTTGGTAA AACTGGTGTA
GTTGGTGAAT TACAAGTTAC TGAGAAAAAT CATAGTTTCT TGGCAATTCG CACAGATATG
GATGCCTTGC CCATTCAGGA ACGTACTGGT TTAGAATATG CCTCAAGGGC GGATGGTGTA
ATGCACGCTT GCGGTCATGA TATCCACACT ACGGTAGGTT TAGGGGCGGC AATGGTACTG
TCCCAGATGA CAGAGGAGTT GGGTGGAAAT GTGCGGTTTT TATTCCAACC TGCGGAAGAA
ATTGCTCAAG GTGCAAGCTG GATGGTGGCA GATGGGGCAA TCAAGGATGT TTCCGCTATT
TTAGGTATTC ATGTTTTCCC TTCGATTCCG GCTGGCTCGA TTGGGGTGCG TTACGGGGCT
TTAACGGCGG CGGCGGATGA TTTAGAAATT GTGATTATTG GTGAATCTGG ACATGGTGCG
CGTCCCCATG AGGCGATTGA TGCGATTTGG ATCGCTTCCC AGGTGATTAC TAGTTTGCAA
CAAGCCATTA GCCGCACCCA GAACCCCCTA CGTCCGGTGG TGTTGACCAT TGGGAAAATT
ACTGGTGGGA GAGCGCCGAA TGTGATTGCT GATAAAGTAC AGTTGTTGGG AACAGTGCGA
TCGCTCCACC CAGAAACCCG CGCCCAACTC CCCAACTGGA TTGAAAGAAT TGTTGCCAAT
GTCTGCCATT CCTACGGGGC AAGTTATCAA GTAAATTATC GCCAAGGTGT CCCCGGTGTC
TACAATGACT ATGGTCTAAC CCAGTTGTTT CAATCAGCAG GTGAAGAAGC TTGGACAAGC
GATCGCGTCC AGGTATTACC TGAACCTTCC TTGGGTGCAG AAGATTTTTC TGTATATTTA
GAACACGTTC CTGGTTCTAT GTTTCGCTTA GGTGTAGGCT ACCCTGAGCG AATTATCAAT
CATCCCTTAC ATCACCCTGA ATTTGAAGTC GATGAATCTG CGATCGTCAC AGGTGTGGTA
ACTATGGCTT ACGCTGCTTA CAAATATTTG CGGGGATGA
 
Protein sequence
MLTLIKDLAT KLAPRLIEIR RHIHSHPELS GQEYQTAAFV AGVLSSSGLR VQEGVGKTGV 
VGELQVTEKN HSFLAIRTDM DALPIQERTG LEYASRADGV MHACGHDIHT TVGLGAAMVL
SQMTEELGGN VRFLFQPAEE IAQGASWMVA DGAIKDVSAI LGIHVFPSIP AGSIGVRYGA
LTAAADDLEI VIIGESGHGA RPHEAIDAIW IASQVITSLQ QAISRTQNPL RPVVLTIGKI
TGGRAPNVIA DKVQLLGTVR SLHPETRAQL PNWIERIVAN VCHSYGASYQ VNYRQGVPGV
YNDYGLTQLF QSAGEEAWTS DRVQVLPEPS LGAEDFSVYL EHVPGSMFRL GVGYPERIIN
HPLHHPEFEV DESAIVTGVV TMAYAAYKYL RG