Gene Ava_1164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1164 
Symbol 
ID3683360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1426867 
End bp1428117 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content44% 
IMG OID637716501 
ProductRNA polymerase sigma factor SigC 
Protein accessionYP_321683 
Protein GI75907387 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.115543 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCAA CATCTTTTTA CGCAGATGCC GCCTACAATA CCCAAAAATC CCGCCAGGCT 
TTAGACCCTG ATATTGCCAT TGATGACAGT GATTTGTCGG TGGATGAGAT CCAAGAATTG
GAGATAGCTG CTGCTGATCC CGCTACTTTT GGTGCTAGCG CTAACCGTCG TAGTACAGAC
TTAGTACGTC TATACCTTCA GGAAATCGGT CGAGTCCGTT TGTTAGGACG GGACGAAGAA
GTTTCTGAAG CTCAAAAAGT CCAGCGTTAC TTAAAGTTGC GGGTAGTGCT AGCTAATGCT
GCCAAACAGG GAGATGAAGT TGCTACTCCT TATCTGCATT TAATTGAAGT TCAAGAGCGT
CTAGCATCAG AACTTGGCCA TCGTCCTTCC TTGGAAAGAT GGGCTGCTAC TGCTGGTATC
AACCTATGTG ACCTGAAGCC AATTTTATCG GAAGGTAAAC GTCGCTGGGC AGAAATTGCC
AAGATGACGG TGGAAGAATT GGAGAAAATG CAATCTCAAG GTCTTCAGTC AAAAGAACAC
ATGATTAAGG CTAATTTGCG CTTAGTTGTG TCTGTGGCTA AAAAATATCA AAATCGTGGT
TTAGAATTAT TAGATTTAGT TCAAGAAGGC ACTCTCGGCT TGGAACGAGC TGTAGAGAAA
TTTGATCCAA CTAAGGGATA TCGTTTTAGT ACCTATGCCT ACTGGTGGAT TCGCCAAGGG
ATTACAAGAG CGATCGCTAC TTCTAGTCGT ACGATTCGCC TCCCTGTTCA TATTACAGAA
AAATTAAACA AAATTAAAAA GGCTCAACGC AAAATCGCTC AAGAAAAAGG TCGCACTCCT
ACTTTAGAAG ACCTAGCAAT TGAGTTAGAT ATGACACCTA CCCAAGTGCG GGAAGTTTTA
TTAAGAGTAC CTCGTTCTGT TTCTTTAGAA ACCAAGGTAG GTAAAGATAA AGACACTGAG
TTGGGAGAAC TATTAGAGAC TGACGGCGTA ACTCCTGAAG AAATGTTAAT GCGGGAATCT
CTACAAAGAG ACTTGCAACA TTTATTAGCT GATTTAACTA GCCGTGAACG TGATGTCATC
TTGATGCGTT TTGGTTTAGC TGATGGTCAT CCTTACTCAT TAGCAGAAAT TGGCCGCGCT
CTGGATTTAT CACGGGAGCG AGTACGCCAA ATCGAATCTA AGGCCTTGCA AAAGCTCCGC
CAACCGAAGC GCCGCAACCT TATCCGTGAC TATTTGGAAT CTCTGAGTTA G
 
Protein sequence
MPATSFYADA AYNTQKSRQA LDPDIAIDDS DLSVDEIQEL EIAAADPATF GASANRRSTD 
LVRLYLQEIG RVRLLGRDEE VSEAQKVQRY LKLRVVLANA AKQGDEVATP YLHLIEVQER
LASELGHRPS LERWAATAGI NLCDLKPILS EGKRRWAEIA KMTVEELEKM QSQGLQSKEH
MIKANLRLVV SVAKKYQNRG LELLDLVQEG TLGLERAVEK FDPTKGYRFS TYAYWWIRQG
ITRAIATSSR TIRLPVHITE KLNKIKKAQR KIAQEKGRTP TLEDLAIELD MTPTQVREVL
LRVPRSVSLE TKVGKDKDTE LGELLETDGV TPEEMLMRES LQRDLQHLLA DLTSRERDVI
LMRFGLADGH PYSLAEIGRA LDLSRERVRQ IESKALQKLR QPKRRNLIRD YLESLS