Gene Ava_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1038 
Symbol 
ID3678706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1262463 
End bp1263530 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content35% 
IMG OID637716374 
Productglycosyl transferase family protein 
Protein accessionYP_321557 
Protein GI75907261 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.378242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.03094 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATTG CAAATATAGA ATATCCAAGC TTAAGTTTAG CAATTCCAGC TTATAACGAA 
GCTGGGAATA TAGAACATTT GATTAGGGGA TTTTTGACAA CTGAATATCC AAACTTAATA
GAAGTAATTG TGGCTGATGG TGGTAGCACT GATGGGACAC AGGATATTGT CAAAAAATTG
TCATTAGAAG ATTCTAGAGT CAAGCTGTTA TACAATTCGT TGAAAATTCA ATCGGCTGGT
CTAAATCTTA TATTGCAAGA ATGCACTGGT GATATATTTC TTAGGGCTGA TGCTCACTCT
GATTATGCAC CAGATTATAT AGAAAGATGT GTGGAAGCAT TATTAGAATC TCAAGCCTTT
AACGTGGGGG GCGCACAAAG ATTTGTGGCT AAAACTCCCT TTCAGGCTGG GGTAGCACTG
GCTTCTAAAA GCTTTTTAGG TAGTGGTGGA GCTAAATATA GAAACCCTAA CTATAATGGA
TATGCTGATA CAGTTTATTT AGGATGTTTT TGGACGAAAG AATTGCGTGG CGCTTCTGGT
TTTGATATTT CACAAATTAC TAACCAAGAT GCTGAATTAA ATCAAAAATT ACTGAATAAA
AACCCAAAAG CTATATATAT AAGTTCAGAT ATTTGTGTAT GGTACTATCC CAGAAAAACC
TGGAAATCTA TTTGTATTCA ATACTTCAAA TACGGAAGAG GACGTTACTT AACTAGTATT
AAACACACAA AGCAACTGCA ACTGAGAGGA AGGCTACCAT TTTTATTTAT ATCAGCTACA
TTGTTTTTAT CGCTAATTGA TTTCATCATT CCTCAGTTAT CTCTACATAC AGAAGTATTA
ATTCTGAGTT GCTTACTTTT TCCATTTGGG GAAAGTTTAC GCACAATTTT CAAATTCCGT
AACGAATTTA CTAAAGAACT CTGGCGCGGT AGCGAAGATG AAATTCCTTC CTGTATAAGT
CTGTGGTTTT TCTGTGGAGT TACATTACTA ACTATGCCAA TTGCTCACTT TTCTGGCTAT
GGATATCAGC TATTTAGGCG TAGATTTCTA AAAGTTACAG GTTGGTAA
 
Protein sequence
MSIANIEYPS LSLAIPAYNE AGNIEHLIRG FLTTEYPNLI EVIVADGGST DGTQDIVKKL 
SLEDSRVKLL YNSLKIQSAG LNLILQECTG DIFLRADAHS DYAPDYIERC VEALLESQAF
NVGGAQRFVA KTPFQAGVAL ASKSFLGSGG AKYRNPNYNG YADTVYLGCF WTKELRGASG
FDISQITNQD AELNQKLLNK NPKAIYISSD ICVWYYPRKT WKSICIQYFK YGRGRYLTSI
KHTKQLQLRG RLPFLFISAT LFLSLIDFII PQLSLHTEVL ILSCLLFPFG ESLRTIFKFR
NEFTKELWRG SEDEIPSCIS LWFFCGVTLL TMPIAHFSGY GYQLFRRRFL KVTGW