Gene Ava_2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2100 
Symbol 
ID3680496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2593832 
End bp2594977 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content43% 
IMG OID637717445 
Productglycosyl transferase, group 1 
Protein accessionYP_322617 
Protein GI75908321 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00100338 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCATTGA AATATGCTCT TGTTCATGAG TGGCTGACAC CAAAAGCCAC CGGCGGTTCA 
GAACTCGTTG TCAAAGAAAT TCTGCATCAT ATCGATGCCG ATTTATACGC TCTCATCGAC
TTTGAGTCTA ACAATCCTGA AAGTTATTTG TACCAACGTC AGATTGGCAA GACCTTCCTC
CAGCACTTTC CCTCGGCCCG TAATGGTATC CAAAAGTATC TACCATTTTT ACCCCTGGCA
ATCGAACAAT TGGACTTGCG CCAGTATGAC GTAATCTTGT CCTCATCTCA TGCTGTTGCC
AAAGGCGTAT TAACCACTGC TGACCAGTTA CATATTTGCT ATTGCCATAG TCCCATGCGC
TACGCCTGGG ACTTAACCTT TGATTATTTG CGTTCCAGCC CAATGGGCAG TGGTGTTGCG
GGATGGGTGA CTCGATACTT ATTACATCGT TTACGCCAAT GGGATGTGTT AAGCGCCAAT
CGAGTAGATT ACTTCATTGC CAATTCCCAT TACACAGCTA GGCGTATATG GCGCTGCTAT
CGGCGAGAAG CCAAAGTTAT TTATCCGCCG GTGAATGTCG CAGAATTTCC ATTTTTACCT
CACAAAGAGG ATTTTTATCT CACAGTTTGC CGATTGGTGA GTTATAAACA GGTATCCCTA
ATTGTGAAAG CGTTTAACCA ATTGCAACGG CCATTAGTCA TCATTGGTAC AGGTTCAGAA
ATGAAACAGA TTCGCCAGCT AGCTAATTCT AATATCCAAA TATTAGGTTG GCAACCTGAT
GATGTAGTCA AAAAGTATAT GGCCAAAGCC AAGGCTTTTG TCTATGCTGC CTGTGAAGAT
TTTGGTATAG CTTTAGTAGA AGCACAAGCT TGTGGTACTC CGGTAATTGC CTATGGTATC
GGAGGTGCCA CGGAAACAGT TAGGGATGTA CGATCTTATA AAGATACAGG AACAGGTATA
TTTTTTAAAA TGCAAACTCA AGCAGCTTTG GTGGAGGCAG TAGAAAAATT TGAAATGTAT
CAAGATGCTC TTGACCCTGA GTATATGCGA TCGCACGCTG CTGAGTTTTC CCCGCAAAAC
TTTGCCAAGC GCTATCTAGA TTTTTTAGAC CAGTGCCATC AACAAAAGCC TAATTTAGCA
GGTTAG
 
Protein sequence
MPLKYALVHE WLTPKATGGS ELVVKEILHH IDADLYALID FESNNPESYL YQRQIGKTFL 
QHFPSARNGI QKYLPFLPLA IEQLDLRQYD VILSSSHAVA KGVLTTADQL HICYCHSPMR
YAWDLTFDYL RSSPMGSGVA GWVTRYLLHR LRQWDVLSAN RVDYFIANSH YTARRIWRCY
RREAKVIYPP VNVAEFPFLP HKEDFYLTVC RLVSYKQVSL IVKAFNQLQR PLVIIGTGSE
MKQIRQLANS NIQILGWQPD DVVKKYMAKA KAFVYAACED FGIALVEAQA CGTPVIAYGI
GGATETVRDV RSYKDTGTGI FFKMQTQAAL VEAVEKFEMY QDALDPEYMR SHAAEFSPQN
FAKRYLDFLD QCHQQKPNLA G