Gene Ava_3670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3670 
Symbol 
ID3679226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4576335 
End bp4577846 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content41% 
IMG OID637719021 
ProductNHL repeat-containing protein 
Protein accessionYP_324171 
Protein GI75909875 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0526] Thiol-disulfide isomerase and thioredoxins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCCCC GTGTAAGAGC GCCAGAACTA CCGCAAAATT ATCCTTGGTT GAATACTGAA 
CAACCTTTGT CTATTAAGCA ACTTAGAGGT AGAGTTGTAA TTTTAGATTT TTGGACTTAT
TGTTGTATAA ACTGTCTCCA CGTTCTGCCA GATTTGAAAT ATCTGGAACA AAAATACAAG
GATAGCCTGA CAGTTATTGG TGTCCATTCT GCCAAATTCG ACAACGAACA GGAAACCGAA
AACATCCGCC AAGCTATATT GCGCTACGAT ATTGAACACC CAGTATTAGT AGACAAAGGT
TTTCGTGTAT GGCAAGAGTA TGCTGTACGT GCTTGGCCTA CTTTAATGGT TATCGACCCC
AAAGGTTATG TAATTGGCTA CGTTTCTGGT GAAGGAAATC GAGATAAGTT AGACCAATTA
ATTACACAAG TCATTCAGGA ACATCAAGGC GCAATTAATT TCCAACAACT CAGCCTCACT
CTAGAAAAAC AGCGTCAACC ATTAATTACA CCTCTAGCTT TTCCAGGTAA AGTTCTAGCC
ACCCCAGGCG GGTTATTCGT CGCCGACTCC GGACATCACC GCATAGTTGT GAGTGACTTC
AACGGTGAGA TTCTGCATTT AATCGGTAAC GGAAAGTCAG GCTTAACTGA TGGTAATTTT
CAGGAAGCAC AGTTTTCCGC ACCCCAGGGA ATGGCGTTTG ATATGGAAAA TCAAATTCTC
TACGTCGCTG ACACAGATAA TCATGTTGTG AGACGGGCTG ATATTCAGCA GCAAACAGTA
GAAACCATTG CAGGGACAGG TGAACAAAGC CGCAATATTC AACCGCATGG GGGTGCTGGT
TTAGAGACTG CTTTAAACTC CCCTTGGGAT TTGGTAAAAG TTGGAAATAG TTTATACATT
GCAATGGCAG GAACCCATCA AATTTGGCAA ATGGATTTAC CAAGCGGCTT TGTCAAAACC
TATGCAGGTA CAGGTGCAGA AGGTTGTTTT GATGGTTACC TGACAGAATC AGTTTTTGCC
CAACCTAGTG GAATTACTAA TAATGAACAA GAATTATACA TTGCTGACAG TGAAATCAGT
TCCATTCGTG GTGTAGGACT ATTGGAACCT CAGGAAGTCA GAACCGTTTG CGGTAGTGGT
GGTTTATTCG GTTTTGGTGA TGTCGATGGA CAGGGTGAAA ATGTCCGTTT ACAGCATTGT
TTAGGGGTGG AATATTTTCA AAATTATTTG TGGGTAGCAG ATACATACAA CCACAAAATT
AAATTAGTTA GTCCTCATAC TGGTAATTGT CAAACTGTCC TGGGAGATGG TTCGGCTGGG
TTACAAAATG GTCAAGGTAA AAATACGCGC TTTTTTGAAC CTTCCGGCTT GAGTGCGATG
GATTCATATC TGTATATTAG TGATACGAAC AATCATGTAA TTCGCCGTGT AGATTTGCGT
ACTTTGGAAG TGACGACGAT GCAATTTAAT GGTTTATGTG CGCCTGATGT TTGTATTCCA
AATAACTTTT AA
 
Protein sequence
MIPRVRAPEL PQNYPWLNTE QPLSIKQLRG RVVILDFWTY CCINCLHVLP DLKYLEQKYK 
DSLTVIGVHS AKFDNEQETE NIRQAILRYD IEHPVLVDKG FRVWQEYAVR AWPTLMVIDP
KGYVIGYVSG EGNRDKLDQL ITQVIQEHQG AINFQQLSLT LEKQRQPLIT PLAFPGKVLA
TPGGLFVADS GHHRIVVSDF NGEILHLIGN GKSGLTDGNF QEAQFSAPQG MAFDMENQIL
YVADTDNHVV RRADIQQQTV ETIAGTGEQS RNIQPHGGAG LETALNSPWD LVKVGNSLYI
AMAGTHQIWQ MDLPSGFVKT YAGTGAEGCF DGYLTESVFA QPSGITNNEQ ELYIADSEIS
SIRGVGLLEP QEVRTVCGSG GLFGFGDVDG QGENVRLQHC LGVEYFQNYL WVADTYNHKI
KLVSPHTGNC QTVLGDGSAG LQNGQGKNTR FFEPSGLSAM DSYLYISDTN NHVIRRVDLR
TLEVTTMQFN GLCAPDVCIP NNF