Gene Ava_1139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1139 
Symbol 
ID3683393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1394725 
End bp1395825 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content44% 
IMG OID637716475 
Productglycosyl transferase, group 1 
Protein accessionYP_321658 
Protein GI75907362 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0404734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.023827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATAC TACATATTTT GAATCACGTC CGTGAAATTG GTAATGGTAT CGTTAACGTC 
GCTGTAGATT TAGCTTGTCT GCAAGCCAAA AATAATCATG ATGTAGCTGT AGCATCTGCT
GGCGGAGAAT ATGAAACATT ACTGGCTGAA TATGGTGTTA AACACTTTGA GTTAAACCAA
AGCCGCACAC CCCTAAATTT AATTCATGCG GCTAGGCGTT ATCGAGCCAT TATCCGAGAA
TTTCAACCGG ATATAGTCCA TGCTCATATG ATGACGGGAG TTGTGCTGGG AAGATGTCTC
AAAGCTGATT CTGAGTATGC TTTGGTGGCG ACAGTACACA ATGAATTTCA ACGTAGTGCA
GTACTCATGG GATTAGCAGA TCGGGTAATT GCGGTGAGTC ATGCTGTCGC TAAATCAATG
GCGAGACGTG GCATTCCTGA GCATAAATTG CGGGTGATAT CCAACGGCAC ATTAGGCAGC
GTCCGTACTC GTAATATTCA AGATTACTCA CCAGTAGAAT TACAACGACC AGCGATCGCT
ACTGTAGCAG GGATGTATCA ACGTAAAGGC ATTGGTGAGT TAATCGCGGC TTTTGCACAG
ATTGCCCAAG ATTTCCCCCA AGCCCATCTT TATCTCGTTG GAGAAGGCCC CGAAAGGCAG
ATTTTTGAGG AAAAAGCCCA AGCTACAGGT TTGAGCGATA CACGCATTCA TTTTGTAGGT
TTCCAACCAG AACCACAACG CTACTTATTA GCCGCAGATA TATTTGTACT TGCTTCTCAC
CGTGATCCTT CTCCCCTAGT AATTCCAGAA GCCCGTGAAG CCGGCTGTGC TATTGTCGCT
ACTAATGTAG ATGGCATTCC CGAAGCATTA GACAATGGCA AGGCTGGTGT TTTGGTTCCC
CCAAAAGATA GTTCTGCTTT GGCAGATGCC CTAGCAAAAT TACTTAGCCA ACCTCATCTG
CTCAAGCATT GGCAAGACCA AGCGCAACAA AATTTAGAAT TGCTGACAGT TGGGCGTGTG
AATCAACAAA CTTTGGCAGT TTATGCTGAG GTTAAAATAG GATTTAGAAA CGTAATTTGC
CATACCCTAA AAAAAAGCTA G
 
Protein sequence
MRILHILNHV REIGNGIVNV AVDLACLQAK NNHDVAVASA GGEYETLLAE YGVKHFELNQ 
SRTPLNLIHA ARRYRAIIRE FQPDIVHAHM MTGVVLGRCL KADSEYALVA TVHNEFQRSA
VLMGLADRVI AVSHAVAKSM ARRGIPEHKL RVISNGTLGS VRTRNIQDYS PVELQRPAIA
TVAGMYQRKG IGELIAAFAQ IAQDFPQAHL YLVGEGPERQ IFEEKAQATG LSDTRIHFVG
FQPEPQRYLL AADIFVLASH RDPSPLVIPE AREAGCAIVA TNVDGIPEAL DNGKAGVLVP
PKDSSALADA LAKLLSQPHL LKHWQDQAQQ NLELLTVGRV NQQTLAVYAE VKIGFRNVIC
HTLKKS