Gene Ava_2389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2389 
Symbol 
ID3683232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2968932 
End bp2970722 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content32% 
IMG OID637717735 
Productglycosyl transferase family protein 
Protein accessionYP_322902 
Protein GI75908606 
COG category[S] Function unknown 
COG ID[COG4627] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.062581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000855128 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGTTA GTGTTCTAGT TATAACCTAC AATCATAGCA GATTTATTGC ACAGGCAATA 
GAAAGCGTGT TGATGCAAAA AGTTAACTTC GAGTATGAAA TTGTTGTAGG GGAAGATTGC
TCTACAGATG ATACACGCAA AATTTTAATT GATTATCAAC AAAAATATGC AGATAAAATT
CGCTTATTGT TACCTGAAAA AAACTTAGGT ATGCACAGGA ATTTTGTTAA TACTTTGCAA
GCCTGTTGCG GTGAATATGT AGCAATTTTG GAGGGAGATG ATTACTGGAT TGCAGAGGAT
AAACTACAAA AACAAGTAGA CTTTTTAGAT AAAAATTTAG AATTTACGAT TTGTTTTCAT
AATGTAATTA TTTTTCATGA GGATAATCAA TACCAACCCT ACCTATTTCT TCATAATCAA
CCTCCAGTCT CCTATATAGA AGATTTATTA ATACGCAATT TTATTTCCAC ACCCTCTGTA
ATGTATCGCG CTGGATTAGT AGATAATATA CCTAATTGGT TCTATGAACA AGGCATGGGC
GATTGGATTT TCCACATTCT TAATGCACAA TATGGAAAAA TTGGATACAT TGATAAAGTA
ATGTCAGCTT ATAGAATTCA TGCAGAAGGA GTCTGGTCAA GTAAGAACAG AGATTGGCAA
CTAAATAAAA CCATTAAAAT GCTAGATACT GTTAAATCTA ATCTAGATGG GCAGTATGAA
GAAATTATAG ATAGAGCAAT TCAATTTTAT TCGCAGCATT TATTGAAGTT AAATTCTATA
CATCAAGGAA CCACTTCTAA ATTCACTACA AATCTACCCA AACATTTGGA ATTAAAAGAT
ATAAATTTTA TAGTCTTTCC AGATTGGAAT CAAGAAGAGG AACATCTATA TCAAAATCTA
TATTGTCTTG TTAGATTACT CTTAACGCAT CCAGAAAATA ATAGAATTGC TTTATTTATA
GATACAACCA AGATTAGCGA AGAAGATGCT AATTTATGTA TATCTAGTAT TTTAAGTAAT
CTGATATCAG AAGAACAATT AGAAAAAAAT AAAAATATTA ATGTATTTTT AGTCGGTAAA
CTTGACCAAA TTCAGTGGCG AACTCTTACA TCTTTCCTGA AAGCTAGAAT TGTATTAGAT
TCAGAAAATT CAGATGCACT TCTTACTTCT GGAACAGAAA TTATTCCAGC CTATACAGTA
AATCAACTTA TTGAATTTGG CTTAATATTT ATTGGCTCTG CAAAACCTTA TAAATTAAAT
ATTGGCTGTG GCAATGTCAG ATTTGATGGC TGGATAAATA TAGATATAGA GCAAAATTAT
AAAACAGTGG ATTTAGTCTG TGATGCTAGA CAGAAACTAC CATTCGATGA TAATTCTTGT
GAATTAATAT ATAACGAGCA TTTTCTGGAA CACTTAACCT TGAAAGAAGG ATTATTTTTT
TTGAAGGAGT GCTATCGTAT CTTAAAGCCC GAAGGGATTT TACGCATAGC AATGCCGTCC
TTAGAATATG TTGTACAAAA ATATATGTCT GACGATTGGC GTAATCAAGA GTGGCTCAAA
TATCCAGAGT ACCAATTTAT TAAAACACGT GCTGAGATGC TAAATATTTC TATGCGTTGG
TGGGGTCATC AGTGGCTTTA TGATACAGAA GAGTTATATC GCAGATTAAC TGAATCAGGA
TATATAAATA TTCAAGAGTT TGCATGGGGC AAGAGTAATA CACCGGAGTT AAGAAATCGT
GAGACTAGAG TTGATTCAAT ACTGATATGT GAATCACAAG TGCTGAAATA A
 
Protein sequence
MKVSVLVITY NHSRFIAQAI ESVLMQKVNF EYEIVVGEDC STDDTRKILI DYQQKYADKI 
RLLLPEKNLG MHRNFVNTLQ ACCGEYVAIL EGDDYWIAED KLQKQVDFLD KNLEFTICFH
NVIIFHEDNQ YQPYLFLHNQ PPVSYIEDLL IRNFISTPSV MYRAGLVDNI PNWFYEQGMG
DWIFHILNAQ YGKIGYIDKV MSAYRIHAEG VWSSKNRDWQ LNKTIKMLDT VKSNLDGQYE
EIIDRAIQFY SQHLLKLNSI HQGTTSKFTT NLPKHLELKD INFIVFPDWN QEEEHLYQNL
YCLVRLLLTH PENNRIALFI DTTKISEEDA NLCISSILSN LISEEQLEKN KNINVFLVGK
LDQIQWRTLT SFLKARIVLD SENSDALLTS GTEIIPAYTV NQLIEFGLIF IGSAKPYKLN
IGCGNVRFDG WINIDIEQNY KTVDLVCDAR QKLPFDDNSC ELIYNEHFLE HLTLKEGLFF
LKECYRILKP EGILRIAMPS LEYVVQKYMS DDWRNQEWLK YPEYQFIKTR AEMLNISMRW
WGHQWLYDTE ELYRRLTESG YINIQEFAWG KSNTPELRNR ETRVDSILIC ESQVLK