Gene Ava_2341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2341 
Symbol 
ID3683456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2904514 
End bp2905794 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content45% 
IMG OID637717686 
Productpeptidase M16-like 
Protein accessionYP_322854 
Protein GI75908558 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCAA CCCTGCGGAA ATTGCCCCGA CTTAATGCCC CAAAACTACA TACACTACCC 
AATGGTTTGA CCATCATAGT GGAGCAAATG CCAGTTGAAG CCGTGAATCT CAGCTTGTGG
ATTGATGTTG GCTCATCTGT AGAATCTGAT GCCATTAACG GTATGGCTCA CTTTTTAGAA
CACATGATTT TTAAAGGAAC TGAGCGCCTT GCCAGTGGTG AGTTTGAACG TCACATAGAA
GAGCGAGGTG CTGTTACTAA CGCCGCTACC AGTCAAGACT ACACTCATTA CTATATAAAT
ACTGCTCCTC AAGATTTTGC CAAATTAGCG CCATTACAAA TAGATGTAGT TTTAAATGCA
AGTATCCCTG ATGAAGCCTT TGAACGTGAG CGCTTTGTCG TGTTGGAAGA AATCAAACGT
TCCGAAGATA ATCCCCGTCG CCGTACCTTC CGCCGGGCAA TGGAAACAGC ATTTGCAGAG
TTACCCTACC GCCGTCCAGT ATTGGGGCCA GAGTCGGTAA TTTCCCAACT AACACCCCAA
CAGATGCGAG ATTTTCACGC TAGTTGGTAT CAACCCCAGT CAATCACGGC TGTAGCTGTA
GGTAATTTAC CGGAAGAACA GTTAATTGAA ACTATTGTCG AAGGATTTAA CCAACTCAAA
AAAACTCCCC CATCCCCACT CCCCACTCCC CGCCCCCTCA ATCTCGAACC TGCATTTACA
GAAATTGTGC GTCGGGAATT TGTAGATGAA AGTCTTCAGC AAGCAAGACT GATCATGGTT
TGGCGAGTTC CTGGGTTGAA CCAACTAGAA CAGACTTATG GCTTAGATGT TTTAGCGGGT
ATTTTGGCAC ATGGAAGAAC ATCAAGGCTA GTGCAGGATT TACGGGAAGA ACGAGGACTT
GTAACTTCGA TTTCTGTCAG CAATATGAGT AATCGTTTGC AAGGGACATT TTATATTTCC
GCTAAATGCG CCGTAGAAGA TTTACAAGCC GTAGAGGAAG CGATCGCTCA ACATATCCGT
AAACTACAAA CAGAGTTAGT CACAGAAAAA GAAATCGCCC GTGTCCGTAA GCGTGTAGCC
AACAGATTTA TTTTTGGCAA CGAAACACCA AGCGATCGCG CTGGATTATA TGGATTCTAT
CAATCACTGG TAGGAGATTT AGAACCAGCA TTTAACTACC CAGCCCACAT TCAAACCCAA
GAAGCACCAG ATTTACTCTT GGCTGCTAAC CAGTATCTTT GCCCAGAGGC TTATGGTGTG
GTTGTCATGA AACCAGCGTA G
 
Protein sequence
MTSTLRKLPR LNAPKLHTLP NGLTIIVEQM PVEAVNLSLW IDVGSSVESD AINGMAHFLE 
HMIFKGTERL ASGEFERHIE ERGAVTNAAT SQDYTHYYIN TAPQDFAKLA PLQIDVVLNA
SIPDEAFERE RFVVLEEIKR SEDNPRRRTF RRAMETAFAE LPYRRPVLGP ESVISQLTPQ
QMRDFHASWY QPQSITAVAV GNLPEEQLIE TIVEGFNQLK KTPPSPLPTP RPLNLEPAFT
EIVRREFVDE SLQQARLIMV WRVPGLNQLE QTYGLDVLAG ILAHGRTSRL VQDLREERGL
VTSISVSNMS NRLQGTFYIS AKCAVEDLQA VEEAIAQHIR KLQTELVTEK EIARVRKRVA
NRFIFGNETP SDRAGLYGFY QSLVGDLEPA FNYPAHIQTQ EAPDLLLAAN QYLCPEAYGV
VVMKPA