Gene Ava_2994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2994 
Symbol 
ID3681229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3713261 
End bp3714211 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content42% 
IMG OID637718340 
Productprolyl aminopeptidase 
Protein accessionYP_323499 
Protein GI75909203 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0267164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.7335 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGAAC TTTACCCACT CATCGAACCT TATAAAGAAG GTAAATTAAA GGTTTCCCAA 
TTACACACCA TTCATTTTGA AGAATCAGGA AACCCCCAAG GTAAACCCAT AGTGTTATTG
CATGGTGGCC CTGGTGGGGG CTGTCCTCCA GTTTATCGGC AATATTTTCA CCATGAAAAA
TGGCGATTAG TCATGTTTGA TCAACGTGGC TGCGGTAAAA GTCAACCCCA TGCCGAATTG
AGGGAAAATA CCACTTGGGA TTTAGTCAGT GATATTGAAA AACTCCGAGA ACATTTAGGA
ATAGAAAAGT GGGTGGTTTT TGGTGGGAGT TGGGGCAGTA CTTTATCTTT AGCTTACAGT
CAAACTCACC CTGAGCGTTG TTTAGGCTTG ATTTTACGCG GGATATTTTT GCTCAGACAA
AAAGAGTTAC GCTGGTTTTA TCAAGAAGGT GCTAGTTATA TTTTTCCTGA TGCTTGGGAG
GAATATCTGC AACCAATTCC TGTAGATGAA CGTGATGATT TACTCACGGC TTATTACCAA
CGTTTAACTA GTCCAGATTC ACAAGTTAGA CAAGAAGCGG CTCGTGCTTG GTCAATTTGG
GAAGCTAGCA CTAGCAGATT ATTTCCTGAT ACCCAACTAA AGCAAACTTT TGCTGAGGAT
AAATTTGCAG AAGCTTTTGC CCGGATTGAA TGCCATTATT TTATAAATAA AGGCTTTTTA
AATTCTGACC ATCAACTATT ATTAAATGTT GACTGCATTC GCCATATCCC TAGTGTAATT
GTCCAGGGGC GTTATGATGT AGTTTGCCCA ATGACATCAG CTTGGGAATT ACATCGTGCT
TGGCCGGAAG CTGAATTTAT TGTAGTTCCT GATGCTGGTC ATTCTATGAG TGAAGTGGGG
ATTCGTAGTG CTTTGATTGA GGCGACGGAT AGGTTTGCTG ATGCAGGCTA G
 
Protein sequence
MRELYPLIEP YKEGKLKVSQ LHTIHFEESG NPQGKPIVLL HGGPGGGCPP VYRQYFHHEK 
WRLVMFDQRG CGKSQPHAEL RENTTWDLVS DIEKLREHLG IEKWVVFGGS WGSTLSLAYS
QTHPERCLGL ILRGIFLLRQ KELRWFYQEG ASYIFPDAWE EYLQPIPVDE RDDLLTAYYQ
RLTSPDSQVR QEAARAWSIW EASTSRLFPD TQLKQTFAED KFAEAFARIE CHYFINKGFL
NSDHQLLLNV DCIRHIPSVI VQGRYDVVCP MTSAWELHRA WPEAEFIVVP DAGHSMSEVG
IRSALIEATD RFADAG