Gene Ava_0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0022 
Symbol 
ID3678867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp19248 
End bp20486 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content43% 
IMG OID637715349 
Productglycosyl transferase family protein 
Protein accessionYP_320543 
Protein GI75906247 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis
[COG2246] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.882894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATCA ATAAAACTCA GTCATTGTTG CCAGTACCCG CAGGTAATTT ACAAGTTCCT 
GAGTTTCCAC CTAGTGATTC GGGTGTGACT GGTCAACCCA TCCAATTTTC TCTCATCATT
CCTACTTATA AAGAGAGCGG GAACATTAGG AATGTTGTGG AGAGATTAAG TCAGATACTG
GATGAGTTTA TACCAGGAGA TTATGAACTG ATTGTAGTAG ATGATGATAG CCCAGATGGC
ACTTGGGAAG TAGCCCTATC TTTGATGGCA GAATATCCAC AGTTACGAGT AATGCGACGG
CAGGAAGAAA GAGGACTATC TACAGCTGTA ATTCGTGGAT GGCAGGTAGC TAGAGGTAGT
ATTTTGGGAG TAATCGATGG AGATTTACAG CATCCCCCCC ATGTATTGCT GGAACTTTTG
CGGAAAATCC ATAAGGGTGC AGATTTAGCT GTAGCCAGTC GTCACGTAGA TGGGGGTGGT
GTAAGTAGCT GGAGTTTTAT CAGACGCTTT TTGTCTCGTG GGGCGCAATT ATTAGGACTA
GTGATTCTAC CTAGTGTATT GGGTAGGGTT TCCGACCCCA TGAGTGGTTA TTTTATGGTG
CGGCGTAACT GTATTACTAA TGCCACCTTC AATCCGGTAG GATACAAAAT TTTATTAGAA
GTGATTGGGC GGGGTCAAGT AGACGAAATT GCCGAAGTGG GTTATGTATT TTGTGAACGC
CAAGAAGGTG AGAGCAAAGT TACTTGGAAG CAGTATGTAG ATTACATCCA CCATTTAATT
CGCTTGCGAC TTTCCACCGG CAGGCTGCAA AGAATTCATC AAAGCTTTCC CTTTGATAAA
TTCATCCGTT TTGGTTTGGT AGGGTTGAGT GGTGTGTTTG TGGATATGGT GATACTGTAC
CTATTAAGTG ACCCATCAAC ACTAGCCTGG CCACTGACCC GCAGTAAAAT TATTGCCTCA
GAAATAGCAA TTTTCAACAA TTTTCTCTGG AATGATGCCT GGACTTTTGC AGATGTATCC
ATGCAGCAAC AGCATTGGCA TCAACGGTTG AAGCGATTTT TAAAATTTAA TATTGTTTGT
CTGGCCGGGG TAGTGCTGAA TGTACTGATA TTGAATATTA TCTTTAATTA TCTCATTCCT
AACCGCTATA TTGCCAACCT GATTGCGATC GCCATAGCCA CTGTTTGGAA CTTTTGGGTA
AACTTGCGAC TCAGTTGGCG CGTGACTCAA GTCAAATAA
 
Protein sequence
MSINKTQSLL PVPAGNLQVP EFPPSDSGVT GQPIQFSLII PTYKESGNIR NVVERLSQIL 
DEFIPGDYEL IVVDDDSPDG TWEVALSLMA EYPQLRVMRR QEERGLSTAV IRGWQVARGS
ILGVIDGDLQ HPPHVLLELL RKIHKGADLA VASRHVDGGG VSSWSFIRRF LSRGAQLLGL
VILPSVLGRV SDPMSGYFMV RRNCITNATF NPVGYKILLE VIGRGQVDEI AEVGYVFCER
QEGESKVTWK QYVDYIHHLI RLRLSTGRLQ RIHQSFPFDK FIRFGLVGLS GVFVDMVILY
LLSDPSTLAW PLTRSKIIAS EIAIFNNFLW NDAWTFADVS MQQQHWHQRL KRFLKFNIVC
LAGVVLNVLI LNIIFNYLIP NRYIANLIAI AIATVWNFWV NLRLSWRVTQ VK