Gene Ava_C0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0043 
Symbol 
ID3678119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp64262 
End bp65674 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content45% 
IMG OID637715127 
Producthypothetical protein 
Protein accessionYP_320321 
Protein GI75812704 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0397934 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.5668 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATAACG AAAATCTCAC CTCTAACAAC CAAATTTTAG ATACTACAGC AGAAGTAACT 
GCGATTGTAC CAGTGAGAGC TGCAAGTCAG GAAACCACTC ACAACAAAAC CAATCGAGGG
CTTGAGTATC GGATAGAAAA GTTTAATAAA TTACGCTACC TAATGGTTGC AGGACTTATT
GCTGGGCTTG CCCTACATGG GTTAATCCGA TATGGAAGTA GCTATAATAT CGGCAGTCTC
TTATTCGGCG ATAAAGCGGT AGCTCAAACA GTTTCATCGA AGTATTCAAA TTTGAACACT
TCGATAAATA AGACAAAACA AACCAAACAG CCACAGCCGG TTGCAACAAA ATACGTCAAT
GGCGCACCTA CTCCCGATTG GAGCAGTATC ACGTTTAGAA ATATGAAATT TGGCGGTGCT
GGCAGCGTTG AATTCCCCAA TATCAAAAAC CCTGGTATGA AGGAGACACG CACCTGGAGT
GCTGGACAAA GCCTTGCTGA AGTCATGGAA CTGGGTGATT TTGAGGCAAC TGAGTTCAAT
ATTGAAGATT TCACGCTTAA AGATATCGCT GACATCACGG GCATTCAACT CAAAAATCTC
AAACTGTCAG ATTTTGAAAC ACTCGATTGG CAAACACTCG AAGACCTCGC CGAAGCAATT
CCTAATTTAG AAGATTCAGA AATCGCAGCC ATCCCACCAA TTCAAGAACT AGTGACCAAA
GTTACAGGCA GCACCGCAAG CTACCAAACT GTTGGTGAGG TGCTGGACAA TTATCCCCAA
CTGGGAGAAG TGGAACTTGG TGAGTATATC AAACTTGATA AATATAAACT CACCAGTATC
CCTGGTATTG AGGATGCACA GCTAAAAGAC TTTACTAACT GGCAAAACAC AACAATTGGT
AAGGTTCCGG GATTAGCTGA TGTGCCGTTT GATCGGTTCC CCAGTGTTCC CATCCCCGAT
GTTAGCTTTA TCGGCAAAGT AGACTTACCC CTCGGTTCGC TTGAAAGTGG ACGCTGGAAG
TCAATAAGTG GTAGCTATCA AGCTGGGTTT AATGTCCCGT GCGAGAAAAC CTGTGGTCAC
ATTGAAGTTT CGGGTAGCGA TATTGTGACT GGCGCACAAT GGATGTCCGG CAAAGACCAA
AAAGTTAAAG GCGGATTTGG CATCTTGGGC AGTCTCAACG GTGGAAAAGA ACCCACAGGT
CGCCACCCCT TTGGAAAATC ATTTAAACAA GTCATCTGGG ATATTAACGA GGCGGACGGA
AGTATCACCA CCGCGATGTT CTTCCGTATC TGCAAGCGGG GGTGGGTTGA CTTAGGCTGC
TCGCCCTACT TCATTGGCCC TGTTCCCTTC TTCACCTACC GCGAGATAGA CCCAATTATC
CTTGGTACGC CCCTCACCGT ACCTAAAAAT TAA
 
Protein sequence
MYNENLTSNN QILDTTAEVT AIVPVRAASQ ETTHNKTNRG LEYRIEKFNK LRYLMVAGLI 
AGLALHGLIR YGSSYNIGSL LFGDKAVAQT VSSKYSNLNT SINKTKQTKQ PQPVATKYVN
GAPTPDWSSI TFRNMKFGGA GSVEFPNIKN PGMKETRTWS AGQSLAEVME LGDFEATEFN
IEDFTLKDIA DITGIQLKNL KLSDFETLDW QTLEDLAEAI PNLEDSEIAA IPPIQELVTK
VTGSTASYQT VGEVLDNYPQ LGEVELGEYI KLDKYKLTSI PGIEDAQLKD FTNWQNTTIG
KVPGLADVPF DRFPSVPIPD VSFIGKVDLP LGSLESGRWK SISGSYQAGF NVPCEKTCGH
IEVSGSDIVT GAQWMSGKDQ KVKGGFGILG SLNGGKEPTG RHPFGKSFKQ VIWDINEADG
SITTAMFFRI CKRGWVDLGC SPYFIGPVPF FTYREIDPII LGTPLTVPKN