Gene Ava_4094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4094 
Symbol 
ID3681559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5086055 
End bp5087233 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content46% 
IMG OID637719442 
Productpeptidase M20D, amidohydrolase 
Protein accessionYP_324590 
Protein GI75910294 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.665516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.883146 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCATG TCCTGACATC TGAAAAAGTC TTCCCTCGAA TGGTAGAGTT ACGCAGATAT 
TTACACAGCC ATCCTGAACT GGCATTTGAG GAAAAAAAGA CAGCATCCAT CGTTATAGAT
GAACTCAAGC GTTTAGGTAT TCCTTTTTGG TATGGTGGGG TGGGTAGTGG CATCATTGGC
AAGCTGATCA ATGCTGGACA AAGAGCGCCG ACAATTGCTT TGCGGGCTGA TATGGATGCT
CTCCCCGGAC AAGAAAATAC AGGATTGCCC TTTGCTTCTT TGCACCCAGG TAAGATGCAC
GCCTGCGGTC ACGACGGTCA CATGGCGATG GTGTTGGGTG CAGCAGCTTT ATTGAAAGAG
AATCCGCCAC CAGGAAACGT GGTTTTCATC TTTCAACCTG CGGAAGAAAA GGGTGCTGGC
GCGAAAGTGA TGATCCAATC TGGTGCTTTA GAAGGAGTAA ATGCCATCTT TGGCGGTCAT
GTTACCCGTC ACTACCAAGT CGGAGAGATT ATGGTTGCCA AGGGAGTAAT TACTGCCCAA
TCTGATGGCT TTACTATTCG CGTCAAAGGA CGAGGCGGAC ATGGAGCTAG ACCCCATGAA
GCTGTTGATG CTGTAGTTGT TGCCGGACTG CTAATTATGG CTGTGCAAAC CTTAGTTTCA
CGAGAAATCA ACCCCGCCTA TCCTTCAGTA GTTACCATCG GCAAAGTAGA AGCAGGTAGT
GCAGGTAATG TCATCGCGGA GGAAGCAATT CTAGAAGGTA CTATTCGCAC AACAAATTTA
GATGTGCAGA ACCACATAAT TGACGGACTC AAGCGCATAG CCACAGCCGT GGGAGAACTG
CACAATGCCA GAGTAGAAAT AGAAATTCGC CACGGATATC CACCTGTCAT TAATACAGGA
AAAGAAACAG AAATTGCTCG ACGGGCTATA GTTGATATTT TGGGTTCCAA GGGATTAGTC
ACAATGGATT ATCCCAGCAT GGGAGCAGAA GATTTTTCTT TCTACTTGTT ACACGTTCCT
GGGTGTTACG TTAGATTTGG TGCTTGTCAA CAAGGTTGTG AGAACATTCC TTTGCATAGT
CCTTCCTTTG ACTTTGACGA AGAAGCATTG AAAGTTGGCG CAGCTTTTTT TGATCGGGTG
GTTCGAGAAG CGATCGCAGA ATATGCAGAT AATTCCTAG
 
Protein sequence
MAHVLTSEKV FPRMVELRRY LHSHPELAFE EKKTASIVID ELKRLGIPFW YGGVGSGIIG 
KLINAGQRAP TIALRADMDA LPGQENTGLP FASLHPGKMH ACGHDGHMAM VLGAAALLKE
NPPPGNVVFI FQPAEEKGAG AKVMIQSGAL EGVNAIFGGH VTRHYQVGEI MVAKGVITAQ
SDGFTIRVKG RGGHGARPHE AVDAVVVAGL LIMAVQTLVS REINPAYPSV VTIGKVEAGS
AGNVIAEEAI LEGTIRTTNL DVQNHIIDGL KRIATAVGEL HNARVEIEIR HGYPPVINTG
KETEIARRAI VDILGSKGLV TMDYPSMGAE DFSFYLLHVP GCYVRFGACQ QGCENIPLHS
PSFDFDEEAL KVGAAFFDRV VREAIAEYAD NS