Gene Ava_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1041 
Symbol 
ID3678593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1265694 
End bp1266956 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content35% 
IMG OID637716377 
Productglycosyl transferase, group 1 
Protein accessionYP_321560 
Protein GI75907264 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000238759 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0144782 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGATTT TACATCTTAG CACTTACGAT AATCGCGGTG GAGCAGCAAT AGCTACCTAT 
CGATTACACG ATGGGCTACA AAATATTGGC ATAACTTCCC AGATGCTTGT ACAAATTAAA
TTTAGTGACG ATAAATCTGT TATTGCAACA GGCAACAAAA TAGTTCATAA ATATCCCAAA
CTTAAACCCC ATTTAGATTC GTTACCGAAA CTATTTTTTA GACATATCGA TAAGAGTCGA
AGAACTTCAT ACTCTTTGCA ATGGCTTCCA GATTCTATAG CGACTAGCAT TATAAAAATA
GATACTGATA TCATTCATCT TCATTGGATA TCAGGAGGAC TTATAAATAT AGAAACAATA
GCTAAATTAA ATAAACCGCT TGTTTGGACT CTACATGATA TGTGGGCTTT TACTGGTGGA
TGTCATTATA ATCAAGAGTG TCAGCTTTAC AAAGAAAACT GTGGCAACTG CCCACAAATG
CCAAAAAGAT TCAATATAGA TTTATCTAGT TGGGTATGGG AACGCAAAGC TAAGGCTTGG
GCTGATTTAA ACTGTACAGT TGTAACTCCA AGTCATTGGT TAGCTAAATG CGCTGCATCC
AGTTCTCTAT TTAAAGATTG CAATATCAAA GTAATACCCT ATGGTTTAGA TACAGAAGTT
TACAAGCCTT ATCAAAAAAA CTTGGTAAGA GATAAATTCA ATCTACCTCA AGACAAACTC
TTAATCCTCT TTGGTGCTGA AAATGCTGCT AGTAATACAC GTAAAGGGTT TCACTTTTTA
AGATGTGCAC TAGAAATATT AAAACATACT TACTGGCATG ATAAGTGTGA GCTTGTTATA
TTCGGTGCAA GTAAGTCAGA TTCTATAAGT AACTTGGGTT TTAATACTCA CTATCTTGGC
CGCTTAAATA ATGAATCTAC AGTAGCGCAA GTTTATTCAG CAGCAGATGT TTTTGTTGCT
CCTTCGATAC AAGATAACTT GCCTAATACG GTTATGGAGT CACTTGCTTG TGGTACGCCC
TGTGTTGCTT TTGATATTGG GGGAATGCCT GACATGATTA ATCATAAACA GAACGGCTAT
TTAAGCCAGC CTTACAATAT TGATGACTTG GCAAATGGAA TTATTTGGGT AATAGAAGAT
AAAGAGCGAC ATCAAAAGCT TTGTGCTAGT TCTTGTGCAA CAGTCAAGGA AAAATTTACA
CTAGAATTAC AAGCGAAAAA TTACTTGTCT TTATATCAAA ATATATTAAA AATAAATAAT
TAA
 
Protein sequence
MKILHLSTYD NRGGAAIATY RLHDGLQNIG ITSQMLVQIK FSDDKSVIAT GNKIVHKYPK 
LKPHLDSLPK LFFRHIDKSR RTSYSLQWLP DSIATSIIKI DTDIIHLHWI SGGLINIETI
AKLNKPLVWT LHDMWAFTGG CHYNQECQLY KENCGNCPQM PKRFNIDLSS WVWERKAKAW
ADLNCTVVTP SHWLAKCAAS SSLFKDCNIK VIPYGLDTEV YKPYQKNLVR DKFNLPQDKL
LILFGAENAA SNTRKGFHFL RCALEILKHT YWHDKCELVI FGASKSDSIS NLGFNTHYLG
RLNNESTVAQ VYSAADVFVA PSIQDNLPNT VMESLACGTP CVAFDIGGMP DMINHKQNGY
LSQPYNIDDL ANGIIWVIED KERHQKLCAS SCATVKEKFT LELQAKNYLS LYQNILKINN