Gene Ava_3572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3572 
Symbol 
ID3679518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4450065 
End bp4451234 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content37% 
IMG OID637718923 
Productglycosyl transferase, group 1 
Protein accessionYP_324073 
Protein GI75909777 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.483254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0128829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAATATA AAAAAGAAAA CTTCAATTTA CTACCTGCTT CTATTCTGAC CTTAGGTTCT 
GGTTGGTTTC CTACAAATCC TGGGGGTTTG GAAAGATATA TTTATGAGTT GACTTATCAA
TTAGCAGCTA ATCAAGATAG AGTAGAGTTA TGTGGAGTTG GCTTACCAAC TAATGAATTT
CATTTGCCAA TTAAATTAAC TAATTTAGCA TCTCCAGATA GTAAAATTTG GCAAAGATTT
TGGTCTATTC GGACTAATTT CCAGAAAACA AGAATAAGCA AACCGGACGC AATCAATTTA
CATTTTGCTT TATATAGTTT TCCTATTTTA GATATTTTGC CTCAGGGTGT ACCCATTACT
TTTAATTTTC ATGGGCCTTG GGCATCAGAA AGTAAACAGG AATTAGTAAA GAATAAAATC
AGTATTTTTT TAAAGCGTCG GCTGATAGAA CAAACCACAT ATAATCGTTG CGATCGCTTT
ATTGTTCTGA GTAAAGCATT CGGCAATATA TTACACCAAC AATATCAAAT TCCTTGGCAA
AAAATACATA TTATTCCTGG TGGTGTGAAC ATTGATAAAT TTCAGCCAAA TTTATCGCGT
CAACAAGCTC GCCAGCAGCT AAATTGGCCT GAAAGTCGTC CTATTTTATT TACATCCAGA
CGTTTAGTTC ACCGTGTGGG AGTAGACAAA CTATTACAAG CATTAGCCAT CATTAAACCA
AGAGTACCCG ATATTTGGCT AGCGATCGCC GGTCGGGGAC ATCTGCAAGG GACATTGGCA
AAACAAGCTC AAGAGTTGGG TTTAGAGAAC AACGTAAAGT TTTTAGGTTT TCTCCCAGAT
GAGCAGTTAC CTATCGCTTA CCAAGCTGCT AATTTAACTG TTATGCCCAG TCAATCTTTT
GAAGGTTTTG GGTTAGCAAT TACTGAATCT TTGGCTTGTG GTACTCCTGT TTTATGCACT
CCTATTGGAG GTATGCCAGA AATTTTAACT CCATTTTCAC CAGAATTAAT TACTACATCT
GCGGAAGCTA CTGCTATTGC GGAGAAAATA GTACATATAT TGCTAGAACA AATACCAACA
CCTTCACGAG AAGAATGTCG CCAATATGCT GTAACTAACT TTGATTGGCA GAAAATTGCT
CAACAAGTAC GGCGAGTTAT TTTAGCTTAA
 
Protein sequence
MEYKKENFNL LPASILTLGS GWFPTNPGGL ERYIYELTYQ LAANQDRVEL CGVGLPTNEF 
HLPIKLTNLA SPDSKIWQRF WSIRTNFQKT RISKPDAINL HFALYSFPIL DILPQGVPIT
FNFHGPWASE SKQELVKNKI SIFLKRRLIE QTTYNRCDRF IVLSKAFGNI LHQQYQIPWQ
KIHIIPGGVN IDKFQPNLSR QQARQQLNWP ESRPILFTSR RLVHRVGVDK LLQALAIIKP
RVPDIWLAIA GRGHLQGTLA KQAQELGLEN NVKFLGFLPD EQLPIAYQAA NLTVMPSQSF
EGFGLAITES LACGTPVLCT PIGGMPEILT PFSPELITTS AEATAIAEKI VHILLEQIPT
PSREECRQYA VTNFDWQKIA QQVRRVILA