Gene Ava_2411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2411 
Symbol 
ID3683093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2999314 
End bp3000519 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content45% 
IMG OID637717756 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_322923 
Protein GI75908627 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000241562 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000145022 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAATTTAT CCCTAAAGCA ACTGGCCGTT TATCTGTCTC TACTAGTAGT TGGTGGTAGT 
GCAGGTTTGT TAGGCAGTCG CTATCTCCTC CCACAAAATC GCTCGTTCCA ACAACTCAAA
AATGTCACAG TCGGTTTGCC TTCGGAATCT GTAGCGTCTA ATCCTGTCAT AGGTTCTGCG
GCAAATAATG GGGGGGATAA TGTCAATTTT ATTGCTAGTG CTGTGCAGAA AGTTGGCCCG
GCTGTAGTGC GAATTAATGC CACCCGTAAA GTTGCCAATC CTATCTCTGA TGTTTTAAAG
AATCCTCTAT TACGTCGATT TTTCGGTGAA GATGAACAGC CAATTCCGCA AGAACGAATT
GAGCGGGGTA CAGGTTCGGG GTTTATTTTG AGTGAAGATG GGCAACTACT AACTAATGCC
CATGTCGTAG CTGATACAGA CACCGTACAA GTAACTCTTA AGGATGGTCG GACTTTTGAG
GGGAAGGTAC TGGGAGTTGA CCAGATTACA GATGTAGCTG TTGTCAAAAT CCCTGGAAGA
AACTTGCCGA CAGTGAACTT GGGGAATTCG CAAAACCTCA TTCCAGGACA ATGGGCGATC
GCTATTGGCA ATCCTCTCGG TTTAGATAAT ACTGTCACTA TCGGCATTAT CAGCGCCACC
GACCGCACCA GCGCCCAAGT TGGAGTTCCC GATAAGCGAG TCAGCTTTAT TCAAACCGAT
GCAGCAATCA ACCCCGGTAA TTCTGGCGGG CCTTTATTAA ACGCTCAAGG GGAAGTAATT
GGCGTTAACA CTGCTATTCG TGCAGATGCT CAAGGTCTTG GCTTTGCCAT TCCCATAGAA
ACAGCTGCCC GTGTCGCTAA TGAGCTTTTT ACTAAGGGGA GTGTACAACA TCCGTTTTTA
GGGATTGAAA TGACAGACTT GTCCCCTAGC AAAAAACAGC AAATTAATAT TGAAAACAAG
TTAAATATTC GACAAGACAC TGGGGTGGTA ATTAAAGGTG TCTTGGATGA TTCTCCAGCC
AAAGAAGCAG GCTTGCTCCC TGGTGATGTG ATTCAAAAAA TTAACGGTAA AACAGTGAAA
ACATCAGCCC AGGTACAAAA ATCGGTGGAA TCCAGCACAG TTGGAGATAT TCTAACCGTC
GAAGTTAACC GCAGTGGTGA AATTCTCACC TTAAAGGTTC AGTCGGGAGT TTATCCCAAC
AGATAG
 
Protein sequence
MNLSLKQLAV YLSLLVVGGS AGLLGSRYLL PQNRSFQQLK NVTVGLPSES VASNPVIGSA 
ANNGGDNVNF IASAVQKVGP AVVRINATRK VANPISDVLK NPLLRRFFGE DEQPIPQERI
ERGTGSGFIL SEDGQLLTNA HVVADTDTVQ VTLKDGRTFE GKVLGVDQIT DVAVVKIPGR
NLPTVNLGNS QNLIPGQWAI AIGNPLGLDN TVTIGIISAT DRTSAQVGVP DKRVSFIQTD
AAINPGNSGG PLLNAQGEVI GVNTAIRADA QGLGFAIPIE TAARVANELF TKGSVQHPFL
GIEMTDLSPS KKQQINIENK LNIRQDTGVV IKGVLDDSPA KEAGLLPGDV IQKINGKTVK
TSAQVQKSVE SSTVGDILTV EVNRSGEILT LKVQSGVYPN R