Gene Ava_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0444 
Symbol 
ID3682449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp569295 
End bp570884 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content43% 
IMG OID637715773 
ProductXRE family transcriptional regulator 
Protein accessionYP_320965 
Protein GI75906669 
COG category[C] Energy production and conversion 
COG ID[COG1146] Ferredoxin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.33149 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTATA CAATTCCTAA CAACAGTTGC GTTGGATGTG ACAACTGCCG TCCCCAATGT 
CCTACGGGTG CAATCAAAAT CGAAAACAAT AAATACTGGA TAGATCCGTC TCTTTGTAAT
AATTGTGAGG GTTATTATGC TGAACCGCAG TGTGTGATAG CTTGTCCAGT AAAATCCCCC
ATACCGTGGC AAGCAAAGAA GGGGCGATGT AGAGTGGAAC CGCGAGATGC AACTAGCCCA
GACTTATTTT CTAATGGGAA GAATAACCCA TTTGCATCGG CGATCGCAAT TTGGGAAGCC
TGCAACCTAC TAGCGCAACG TACAGCCCTG AACTGGGAAA CTGACGAAGC AGGCTATCTC
TGCTACAGTC GAGCCGTCAA CCAAGGACGA GGAAAAATTA CCTTTTATAT CCAAGACCCG
TTCCAAGTCA GCGAAAAAGC CAAAAATGTA GCAGCCATTG AGGTATTTGA TATTAGGGCA
GCTTGTATCC ATTTAATTCT ATCTGCCCAC GCTACAGCCT TAGATAAACC TTGGGAACAG
GAGTTTACCA TTGACGAACG ACAGCTAGAA AAATATTTAG GGTTAGAGAA ACGCAAAGAC
CTCAGCAAAG CTGCCAAACT AGCCCTAATG AAAAATCTTG TGCAGCAAGC TTGCTCACTT
ATTATTTCTA TGGACTGGCC TCAGCAAGGG CAAGTCAAGG GGTTTTCAGT CACCAATAGC
CGCTTGTGGG ACTTAGTAAG TATTCAGCAC CACTTTCAAG AAGATAAATT GGGTTGTAAA
TATCTCGTTG GGCTAACATT TAAAGTCAAA GCCGGGACAT GGAGCCAATA TTTCTTAAAC
AAGCAAGGAT GCAAAGAACG CACTGCATTC TATCAATATG GCAGCCTCTC CAAGACTCTG
TTAACCACCG TTATGAGTAT TTGGCAGCAG CATGAAGGAG CCGTAAAATT AATGCTGTGG
TTATTATTTA AAACCAAAAT GGGCAAGGAA CAACGCATCA CTGTTCCTAC CTTACTACGC
ATTGCTTATG GTCAAGAAAA AGTCAACCTT GCTACTAGAC ATAGAGAAGA ACGCAAACGC
TTATTGCGGA CATTTGAAAA TGATTTAGAA GTACTAAATC ATTTAGGAAT TAAACCCATT
TTCGATCCCA TCACTTATCC TCCTGCTATC CAACCCCTAT GGGCAAAATT AGTAGATATT
CCTGAAGATC CCGACGAAGC ATTAGAGTTT TGGATTAATG ATGCAGGTGG TGAAACTCGC
CTCACCGATA GCGGCCCCCG TGGTAAGTGG AATCTGTTGA TGAATGCGCG GATTTTATCC
TTTGAATTAC CGCCAGAATG GGAAAGACAA ATTGCCGAAT CAGAAAAAAA ACAACGGCGA
ACTGCTAAGA GTAGACAAAA AATAAAAACC GCAGGTGATT TAGTTGGTGA GCAAATTTTG
CAAGCTCGCA AAAGTTTGAA TTTATCTCAA CGGGAATTAG CAAAGCTCAC TGGTAAAAGC
CAAAGCTGGA TTCGAGACAT AGAAAATGGT CGCTTAAAAG CGAAGCTAGA AGACCAAACA
CTTTTACGCA AAGTGTTACA CATTGCTTAA
 
Protein sequence
MPYTIPNNSC VGCDNCRPQC PTGAIKIENN KYWIDPSLCN NCEGYYAEPQ CVIACPVKSP 
IPWQAKKGRC RVEPRDATSP DLFSNGKNNP FASAIAIWEA CNLLAQRTAL NWETDEAGYL
CYSRAVNQGR GKITFYIQDP FQVSEKAKNV AAIEVFDIRA ACIHLILSAH ATALDKPWEQ
EFTIDERQLE KYLGLEKRKD LSKAAKLALM KNLVQQACSL IISMDWPQQG QVKGFSVTNS
RLWDLVSIQH HFQEDKLGCK YLVGLTFKVK AGTWSQYFLN KQGCKERTAF YQYGSLSKTL
LTTVMSIWQQ HEGAVKLMLW LLFKTKMGKE QRITVPTLLR IAYGQEKVNL ATRHREERKR
LLRTFENDLE VLNHLGIKPI FDPITYPPAI QPLWAKLVDI PEDPDEALEF WINDAGGETR
LTDSGPRGKW NLLMNARILS FELPPEWERQ IAESEKKQRR TAKSRQKIKT AGDLVGEQIL
QARKSLNLSQ RELAKLTGKS QSWIRDIENG RLKAKLEDQT LLRKVLHIA