Gene Ava_C0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0041 
Symbol 
ID3678117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp61579 
End bp63408 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content45% 
IMG OID637715125 
Producthypothetical protein 
Protein accessionYP_320319 
Protein GI75812702 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.616991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.873724 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACAGC TAGAGAATGA GGCTTCCAAT AAACATATAT CTTCAGTAAA TAGTCTTTTA 
CCAATCGATA GGGCAGAAGA AGATGAAGTT GAAAATTTGC AATTTGAGGT GGTTGAAGAA
GATGAAACTG AAGTAGAAGA CGCCGCGTTA ATCCAAACTA AACACGACTT TGTTACATCT
CCCTGGTCAA GATTAGGAAT TATTGGTGGT GCATTTGGTG CTGGGTTTCT AGTTATATTT
GTTGTCCTCA ACGGCATGAT GAATGGGGGT GGTAAAAATG CCAAAAAACC AGAATTAACT
ACTACTCCAA CTCCCACGGT TGCCGCATCT GAGAAAAAAG AAGGCGATGT TTATGCCAAG
CTTGCTCTCG CCAAGCAACA GGAAGAACTT GATGCCCTAA ATGGTAAAGC TAACACAGAA
GACAAAGAGG AGAAAAAAGA AGCTACTGAA GAAGAACAAG CCAAAAACCA AGAGAGAACC
CGTTCTATTC GACAACGAAG GGTAGTTGCT CAAGAAACTC CCCCTACTTC CAGAAGACGT
GTATACCGGG AAGAAACACC CGAACCTGTT AGACCAACAC GTAGAGTAGT GGCTTCACAG
CCAATACCAC CTCTGCGTCC TAGCGCTCCT CTGCCAAAAT TTGCCAAACA AACAGTTGCC
AGCAACCAGA GTGTCGCCAA AGATCCAATA GCCGAACTCG AACGCCTGCG TAACCTGGGT
TCGGTTGGTC GAGTTGAGTA TATGCTAGCC AGCACTACTA TCAGCGAACC TACAAACACA
ACAGTACCAG AAGTTACAGC TAAAACTGAA GAAACATTAC GCCCAGCAGA ACAAAATAGC
GATCGCTCCC GTCGTCGCCG TAGCCGACGC AGCGAGAATA CCGAAACAAT TACCAATACC
CCTAATCAAG TTGAAGAACT GCGCCCGCGT TGGCAACCTG TAACTACAAA CAACGGTACT
CCAGAGTATA GCAATACTGA TGACAAAGAT AACAATCAAG CCGTGCCAGT TATTTATAGC
TTCTCAGTAA ACGAGCCACA AATTAGCTTG GAACTTGACA AAGAAAAGAC CGTAGAATCC
AGTTTTGCTA GTAACAAAGA AAACAACTAC ACTCAGCAAA TCGAACAAGT TGCAAATAAC
TACCTGCCAG AAGAAGCGCA GATACTCCAA GAAACACAAC CGCAGTATTT AGTTGTCGGT
TCGTTTGCAA GTGCAACCCT GGTAACTCCG CTCGTTATGC CTCAAACCAG TAATAACGCT
CGTTCCCAAG AACAGACAAA TACATTACGG TTTGTCGCCC GGTTGAACGA GCCGCTTTAT
AGCAATACTG GGGAAATAGC CATTCCCGCC GGAACGCAGG TAACTATCGC TATGATTTCG
GTTGATAGCA CATCAGGTGT GCGTGCCGAA GTCGCGGCTA TTCTTAAAGA CGGCACAGAA
TATCCTCTAT CACCTGGAAC AATATCAGTT TTGGGAGAGG CGGGTTCACC CCTAATCGCC
AGACCTTACG ACGACAAGGG ATCTGAAATT GCCAAATATG ATGCCACATT AGGGACAATA
GCTGGTCTTG CCAAGGTAGG GGAAATTATT AACCAGCCGG ATGAAGAAAT CACAGAAGAT
TTGCCTTTAG GTGGGACAAG AACCCGCAGC CGCAATAACA ATCGCAACCT GGGAGCTGCT
TTCCTTGAAG GTGCTTTTGG CAAACTTGGC GAAACACTCA GCAAGCGGAC AGAACGTGCT
ACTGACGAAA TCAACCGCCG TCCCAATGTT TGGTATGTGC CTAAAGACAC CAAAATCACA
ATCAAAGTCG ATAGGTCAAT CAAGCTGTGA
 
Protein sequence
MLQLENEASN KHISSVNSLL PIDRAEEDEV ENLQFEVVEE DETEVEDAAL IQTKHDFVTS 
PWSRLGIIGG AFGAGFLVIF VVLNGMMNGG GKNAKKPELT TTPTPTVAAS EKKEGDVYAK
LALAKQQEEL DALNGKANTE DKEEKKEATE EEQAKNQERT RSIRQRRVVA QETPPTSRRR
VYREETPEPV RPTRRVVASQ PIPPLRPSAP LPKFAKQTVA SNQSVAKDPI AELERLRNLG
SVGRVEYMLA STTISEPTNT TVPEVTAKTE ETLRPAEQNS DRSRRRRSRR SENTETITNT
PNQVEELRPR WQPVTTNNGT PEYSNTDDKD NNQAVPVIYS FSVNEPQISL ELDKEKTVES
SFASNKENNY TQQIEQVANN YLPEEAQILQ ETQPQYLVVG SFASATLVTP LVMPQTSNNA
RSQEQTNTLR FVARLNEPLY SNTGEIAIPA GTQVTIAMIS VDSTSGVRAE VAAILKDGTE
YPLSPGTISV LGEAGSPLIA RPYDDKGSEI AKYDATLGTI AGLAKVGEII NQPDEEITED
LPLGGTRTRS RNNNRNLGAA FLEGAFGKLG ETLSKRTERA TDEINRRPNV WYVPKDTKIT
IKVDRSIKL