Gene Ava_0437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0437 
Symbol 
ID3682598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp557186 
End bp558352 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content45% 
IMG OID637715766 
Productaromatic amino acid beta-eliminating lyase/threonine aldolase 
Protein accessionYP_320958 
Protein GI75906662 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00696357 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.223385 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAATC GTCCTATATA TCTTGATTGT CACGCTACCA CACCTCTAGA TGAACAAGTA 
TTAGCAGCAA TGCTACCTTA CTTTACGGAA AAATTTGGCA ACCCGGCTAG TATCGGCCAT
ATTTATGGTT GGGAAGCAGA GGCGGCTGTG AAACAAGCAC GGGAAATTTT AGCAGCAGCA
ATTAACGCTA GTCCAGAAGA AATTGTCTTT ACTAGTGGGG CTACAGAAGC GAATAATTTA
GCGATTAAAG GTGTGGCTGA GGCTTATTTT CAAAAAGGTC AGCATATTAT CACTGTTGCC
ACTGAACATC ATGCTGTACT TGACCCTTGT GAATATTTAA AAACCCTTGG TTTTGAGATA
ACTATTTTGC CAGTCCAAGC AGATGGATTA ATTGATTTAG CCCAATTAGA AAAAGCGTTG
CGTCCTGAGA CAATTTTAGT ATCGGTGATG GCGGCTAATA ACGAAATTGG GGTGTTGCAA
CCTTTGGCAG AAATTGGGGA AATATGCCGC AGTCACAATA TAATTTTTCA CACAGATGCA
GCCCAAGCTA TTGGCAAAAT TCCTCTCGAT GTGCAAGCGA TGAACATCGA TTTGATGTCT
CTGACAGCCC ACAAAGTTTA CGGGCCTAAG GGTATTGGTG CATTATACGT CCGCAGGCGC
AATCCCAGAG TCCAACTCGC ACCCCAGCAG CATGGCGGCG GACATGAACG GGGAATGCGT
TCTGGGACGT TGTATACACC GCAAATTGTC GGTTTTGCCA AAGCTGTAGA AATCGCCCTC
GCAGAACAGA CAATGGAAAA TCAGCGCCTT ACCCAGCTAA GGGATAGATT GTGGTCACAA
CTTGCACAAT TGGAAGGAAT ACACCTCAAC GGACATCCCA CCCAGAGACT AGCCGGAAAC
TTGAATATCA GCATTGAAGG GGTGGACGGT GCTGCACTCC AGTTGGGTTT ACAGCCTGTT
GTGGCGGTGT CTTCTGGTTC TGCTTGTTCC TCCACCAAAA CTGCGCCCTC CCACGTCCTC
ACAGCTTTAG GAAGTCCAGA AAAACTAGCC TATGCTTCTA TTCGCTTCGG TATTGGACGC
TTTAATACAG CAGAAGAAAT TGATATAGTA GCGAAATATG CGATCGCTAC TATTCAAAGT
TTACGTAAAC AAGCAAGTTT GGTATAG
 
Protein sequence
MSNRPIYLDC HATTPLDEQV LAAMLPYFTE KFGNPASIGH IYGWEAEAAV KQAREILAAA 
INASPEEIVF TSGATEANNL AIKGVAEAYF QKGQHIITVA TEHHAVLDPC EYLKTLGFEI
TILPVQADGL IDLAQLEKAL RPETILVSVM AANNEIGVLQ PLAEIGEICR SHNIIFHTDA
AQAIGKIPLD VQAMNIDLMS LTAHKVYGPK GIGALYVRRR NPRVQLAPQQ HGGGHERGMR
SGTLYTPQIV GFAKAVEIAL AEQTMENQRL TQLRDRLWSQ LAQLEGIHLN GHPTQRLAGN
LNISIEGVDG AALQLGLQPV VAVSSGSACS STKTAPSHVL TALGSPEKLA YASIRFGIGR
FNTAEEIDIV AKYAIATIQS LRKQASLV