Gene Ava_4366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4366 
Symbol 
ID3680610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5473418 
End bp5474785 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content48% 
IMG OID637719719 
Productpeptidase M24 
Protein accessionYP_324859 
Protein GI75910563 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTCTC TAGCTGATAC TTTGCGCGAT CGCCGTCAAA AATTAGCCCG CCTCATCGAT 
TTTCCTGCCA TTTTATGGTC AGGTGGCAGC AATGCACGCA ATTTTCCTGC CAATCGCTTT
CCGTTTCGGG CTAGCAGTCA CTTCCTCTAT TTTGCTGGCT TACCCCTACC CAACGCCGCA
ATTCACCTAG AAGCGGGTAA GCTGACATTA TTTATCGATG ATCCTGCACC TGGTAGCGCC
CTGTGGCATG GAGAAACGCC CAAACGTGAG GAAATAGCTG AGACTATTGG CGCAGATGCA
GCTAAACCCT TATCAGAATT AACGCCATTC CTAGCAGGTG CAGCCACTAT TGCCGTCCAA
GATGCTACAA CCTGGACGCA GCAGTCCCAA CTACTGAACA GATGGGTATT ACCGCAAAGT
CAACCGGAAG GAATTGATTT AGAATTAACG AAAGCGATCG TCACCTTACG CCTCATCCAC
GACGCAGGCG CATTAGCCGA GTTACGTAAG GCAGCAACTG TGAGTGTAGC AGCCCACAAA
GCCGGTATGG TAGCCACCGC CAACGCCAAA ATAGAAGCGG AAGTTAGAGC TGCAATGGAA
AGCGTTTTCA TCGCTCACAA CATGACGACT GCCTATAACA GTATCGTTAC CGTTCATGGC
GAAGTTCTAC ACAACGAACA ATATCACCAT CCTCTGCAAC CAGGGGATTT ATTACTAGCC
GATGTCGGTG GCGAAACAGA ATTAGGTTGG GCTGCTGATA TTACCCGTAC CTGGCCTGTG
TCTGGTAAGT TTTCCCCCAC ACAACGGGAT ATATATGATG TAGTACTGGC AGCCCATGAT
ACCTGCATTG CCAACATCCT TCCTGGTGTA GAGTATGCAG AGATTCATCT AGTAGGGGCT
AGGGTTATTG CCGAAGGTTT GGTAAATTTA GGCATTTTGC GAGGTAATCC CGAAGATTTA
GTAGAAAAAG ATGCCCATGC TTTATTTTTC CCCCACGGTA TAGGTCATCT CTTGGGTTTA
GATGTCCACG ATATGGAAGA CCTGGGTGAT TTAGCCGGCT ATGAAGAAGG GAAGAAAAGG
AGCGATCGCT TCGGCTTGGG CTACCTGCGT TTAAATCGTC CCCTACGTAC AGGAATGTTA
GTCACAATTG AACCCGGTTT CTACCAAGTT CCCGCCATCT TAAACGATGA AAAAATTCGC
TCAAGATATC AATATACAGT CAACTGGGAA CGCCTCTCTC AGTTTGCCGA TGTCCGAGGA
ATCCGCATTG AAGATGATGT TTTAGTTACA GATACAGGCA GCGAAGTCCT CACCGCCGCC
TTACCAAATG ATGCTGACAC CGTAGAACAT CTAGTTAATA AAATCTGA
 
Protein sequence
MGSLADTLRD RRQKLARLID FPAILWSGGS NARNFPANRF PFRASSHFLY FAGLPLPNAA 
IHLEAGKLTL FIDDPAPGSA LWHGETPKRE EIAETIGADA AKPLSELTPF LAGAATIAVQ
DATTWTQQSQ LLNRWVLPQS QPEGIDLELT KAIVTLRLIH DAGALAELRK AATVSVAAHK
AGMVATANAK IEAEVRAAME SVFIAHNMTT AYNSIVTVHG EVLHNEQYHH PLQPGDLLLA
DVGGETELGW AADITRTWPV SGKFSPTQRD IYDVVLAAHD TCIANILPGV EYAEIHLVGA
RVIAEGLVNL GILRGNPEDL VEKDAHALFF PHGIGHLLGL DVHDMEDLGD LAGYEEGKKR
SDRFGLGYLR LNRPLRTGML VTIEPGFYQV PAILNDEKIR SRYQYTVNWE RLSQFADVRG
IRIEDDVLVT DTGSEVLTAA LPNDADTVEH LVNKI