Gene Ava_4398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4398 
Symbol 
ID3680525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5510609 
End bp5512174 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content43% 
IMG OID637719751 
Productpentapeptide repeat-containing protein 
Protein accessionYP_324891 
Protein GI75910595 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0324397 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000237208 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATGTAG AGGAATTGCT GGCACAATAT GCGACTGGAG TCATCAATTT TAGTGGTGTT 
GACCTCTCGG AAGCTAATTT GAGTGGTGTC AAACTCTGTG GTGTGAATTT TAGCCAAGCA
AATTTGAGTA TAGCTAACCT GAGTGGATCA AATTTGAGCG AAGCTGACTT CAGCCATGCC
AAACTGAACG TAGCTAGACT GAGTGGTGCC AATCTCACTA ATGCTATTTT CAATCATTCC
AGCCTCAATG TTGCTAATTT AATTCGTTCT GACCTCAGTC GCGCTCAATT ACGTGGGGCT
TCCTTGGTAC GTGCTGAGTT AATTCGTGCG GAACTCAGTC GGGTTGATTT GTCGGAAGCA
AACCTCAACA GTGCTGATTT GCGAGAAGCT ACACTCCGCC ACGCCAATCT TCGCCACGCC
AATTTGAACG GAGCCAGCTT GAAAGGTGCA TCCTTGGTAG GAGCTAATTT GGAAATGGCT
AACCTTAATG GCTCTGACTT GAGTCGTTGT GATTTAACCA GCGCCAACCT GCGAGATGCT
GAACTAAAAC AGGTAAACTT CCGCCATGCT AACCTTAGTG GTGCAGATTT GAGCGGAGCG
AACCTCCGGT GGGCTGATTT GAGTGGTGTA AACCTGAGTT GGGCTGATTT AAGTAATGCA
AAATTGAGTG GTGCTAATTT AGTCGGCGCA GACTTGAGCA ATGCCAATTT AACAAATGCT
AGTTTAGTCC ATGCCAATTT AATTCAAGCA AAATTAATTA GAGCAGAATG GGTGGGTGCT
GATTTAACCA GCGCCATTTT GACTGGTGCA AAACTTTACT CTACTTCCAG ATTTGGCTTA
AAAACAGAGG GTTTGATTTG TCAATGGATT GACTTAAGTC CTACAGGCGA TCGCTCCATT
ATCCAACGGT TTGACACTGA AGATCCACGA GAATTTTTCA ACGAAACCCC ACCAACCATT
CAAATTGTTA TCGATGCAGC TTTAGAAGCA GAAGCGAACT TTGCCTTAGC TGGCGCTTAC
TACCACATCG CCCAAGAATA CTCCATCCTC AAACAACCCC CCAGCATGGA AATCAGTCGT
CGTCGGACTG TGTTTACATT CCGGGTAGAT AATGACGCGG ACTTATTACC TACAGCCTGT
ATGGCAATTT TACCTTTTAA AGATGCTGTT AATACCCAAA AGAATATTTA TGCCCTGTTA
GCCATGATGG AACAGGAAAA TATATCTTCC TTAGGAATAA AAACACCCCA TCGGGTGAAG
GAATTAACAA GTGCGATCGC GGAAGCTATA AGTCAGGCAC AGACCATGAA AAAAACCAAA
AAGAACCTGC ACTTAGCTGC AAAAATCGAA TTTTTTCAAG CTCCAACCCA AACCATATTA
ACTAATTCCA GCGCTCAAAC TTTGATTGTT CATGACAGCC CCAACTTTGG TAAGCGATTT
ATCAACTCCT CAATTACCGA AATGACTTTT TCATCCGATG TATCCGGTGA ATCTCTGCCA
CATAACTTAC CTGGGTTAAA CACACTTACA GATTTTGTTA ACAGTTTTCA CTATGTGAAT
GAATAA
 
Protein sequence
MNVEELLAQY ATGVINFSGV DLSEANLSGV KLCGVNFSQA NLSIANLSGS NLSEADFSHA 
KLNVARLSGA NLTNAIFNHS SLNVANLIRS DLSRAQLRGA SLVRAELIRA ELSRVDLSEA
NLNSADLREA TLRHANLRHA NLNGASLKGA SLVGANLEMA NLNGSDLSRC DLTSANLRDA
ELKQVNFRHA NLSGADLSGA NLRWADLSGV NLSWADLSNA KLSGANLVGA DLSNANLTNA
SLVHANLIQA KLIRAEWVGA DLTSAILTGA KLYSTSRFGL KTEGLICQWI DLSPTGDRSI
IQRFDTEDPR EFFNETPPTI QIVIDAALEA EANFALAGAY YHIAQEYSIL KQPPSMEISR
RRTVFTFRVD NDADLLPTAC MAILPFKDAV NTQKNIYALL AMMEQENISS LGIKTPHRVK
ELTSAIAEAI SQAQTMKKTK KNLHLAAKIE FFQAPTQTIL TNSSAQTLIV HDSPNFGKRF
INSSITEMTF SSDVSGESLP HNLPGLNTLT DFVNSFHYVN E