Gene Ava_3121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3121 
Symbol 
ID3680867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3876674 
End bp3878293 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content43% 
IMG OID637718465 
Productglycosyl transferase family protein 
Protein accessionYP_323624 
Protein GI75909328 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAG GAAGCTTTAT TTGGGGTCAT CTGGAAAGAC AGCACCGTGC AGTTGATAAA 
TGGATTGATC GCTTATGGCT GGTAGTGTTG TTAATTGCAG CATTACTATT ATTCAGCGTT
AACTTAGGAG GATTACCCCT GCAAGATTGG GACGAGGGAA CTATAGCCCA AATTGCTAGG
GAAATAACAC AAGAATCGGC CAATTCCATA CGTTGGCTAT ATCCCACCCT CAGAGGGCAA
CCTTATCACG ATACACCGCC TCTCATGCAC ATAGTGATTG CAGGGGCTTA CCATTTAGGA
GGTGTGAATG AATGGACGAC ACGCTTACCT GGGGCAATTT TAACTGCCTT ATCAGTACCA
TTACTATATT GTCTTGGTAG AGAGATATTT CGTCAACGCT GGGCAGCGAT TTATAGTGCC
TTAATCTACT TAACAATGTT GCCAGTAGTC CGTTATGGGC GCTTGGCAAT GTTAGAAGGG
GCTGTGGTGA GTTTCTTGTT GGTGATGATG CTGTGTGTGT TGCGATCGCG TCGAGATTTA
CGTTACTGTC TCGGCATAGG TTTAAGTTTA GGATTAATTT GTCTCACCCA AGGACTATCC
GGCTTTTTAT TAGGTGCAGT CGTCCTTGTG TTTTTGTTTT GGGATACACC AAGACTACTT
ACCAGTTATT ATCTATGGAT AGCGATCGCC ATTGGAATTT TGCCTGTAGC TGGTTGGTAT
AGCGCTCAAC TACTACACTA TGGCAATGAT TTTGTCCAGA ATGGCTTACT TCTTCAATCC
CTACAACAAG CGGGTATAGT TAACCACAAG AATGCTCAAC CATCTTGGTT TTACATAGTT
GAACTCCTCA AGTGGACTTG GCCTTGGTTA ATCTTCTTAC CACAAACAAC CAAGTTACTT
TGGGAAAATC GCAATCTTAG CTGGGCAAGA CTCATACTTG TATGGAGCAT AGTTTATTTG
CTACTCATTT CTCTCATGAG CGTGAAACTT GCTTGGTACA TATTCCCCAT TTACCCAAGC
CTAGCCTTAG CTTTTGGCGC ACAGTTAGCA GAAATAGAAA ACTTACCTTT ATTATCATCC
TATCCCCGCG CTTGGGTCGC TGGTTTGTCA ATCTTAGCCG TGGTAGCTTC AGCTGCTAGC
ATTCACTTTA GTTGGGGAAT AGCTGCCAAA ACTGACTTAC AGCTAATTTT TGCCGCAGTC
GCTTTGACGA TGATTATGGG AGCGATTTTA GCAGAACGAG GCGACGGACA ATTTCTCAAA
ATATTGTTTT GGGGTAGTTA TATTTCCCTG CTGCTGTTGA TGAAATCTAA CTACTGGGTT
TGGGAATTAT GGCAAGCTTA CCCAGTAAAA CCTGTAGCGG CAATGATAGA GAAAGTAAAT
CCAGCTGCAA AAAAGATTTA TACATCTTTT CCCTATCATC GCCCATCATT GGATTTTTAT
AGCGATCGCC ATATTATTCC CGCCTCTGCC GAAGAACTAA AACATCATTG GCACTACGAC
AGACAACCAT ATCTACTCAT CAGTTCATCA GATTTCCAAC GCCTGCAATT AGACTCCGTT
CAGATACTCG ATAAAACTGA AGGCTGGCAA TTAATTACCA AAGAAACCAA TCGATTGTAA
 
Protein sequence
MQEGSFIWGH LERQHRAVDK WIDRLWLVVL LIAALLLFSV NLGGLPLQDW DEGTIAQIAR 
EITQESANSI RWLYPTLRGQ PYHDTPPLMH IVIAGAYHLG GVNEWTTRLP GAILTALSVP
LLYCLGREIF RQRWAAIYSA LIYLTMLPVV RYGRLAMLEG AVVSFLLVMM LCVLRSRRDL
RYCLGIGLSL GLICLTQGLS GFLLGAVVLV FLFWDTPRLL TSYYLWIAIA IGILPVAGWY
SAQLLHYGND FVQNGLLLQS LQQAGIVNHK NAQPSWFYIV ELLKWTWPWL IFLPQTTKLL
WENRNLSWAR LILVWSIVYL LLISLMSVKL AWYIFPIYPS LALAFGAQLA EIENLPLLSS
YPRAWVAGLS ILAVVASAAS IHFSWGIAAK TDLQLIFAAV ALTMIMGAIL AERGDGQFLK
ILFWGSYISL LLLMKSNYWV WELWQAYPVK PVAAMIEKVN PAAKKIYTSF PYHRPSLDFY
SDRHIIPASA EELKHHWHYD RQPYLLISSS DFQRLQLDSV QILDKTEGWQ LITKETNRL