Gene Ava_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1034 
Symbol 
ID3678702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1257449 
End bp1259344 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content43% 
IMG OID637716370 
Productglycosyl transferase family protein 
Protein accessionYP_321553 
Protein GI75907257 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.370681 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTTAA AATTGAGCGT TCTTGGCGCT GTTGAGCGGT GCTTCAATAA TCTAGTAAAG 
CGTCCAGCCC TTGCTGTGAC TGCCTCAATT CTGTGGTTGA TTCTAATTGG TTGGATAGGC
TATGGATGGA ACTTAGGCAG TGTTGGCTTG GTTGACGAAA CAGAGCCATT ATTTGCTGAA
GCTTCACGGC AAATGTTAGT TACGGGTGAT TGGATCACGC CGTTTTTCAA TGGTCAAACT
CGTTTTGACA AACCTGCTTT AGTTTATTGG TGTCAAGCGA TCGCCTATGC GGTGTTTGGT
GTGAATGAGT GGGCAGTGCG TCTTCCCTCA GCGTTAGCAG CAATGGGAGC AGTGTCCCTA
GCTTTTTATA CAGTTCATTG GTCTATAACG AAAAAGGATG AGTTAGAACA AGTGACACTG
CCAACTCGCC GCTACTTAAC AGCTGGTGTA GCCGCAGGTG TCATGGCACT CAATGCACAA
ATGATTGTCT GGGGAAGAAC TGGTGTCTCA GATATGCTCT TAACTGGGTG TATCGCCTCA
GCTTTGTTAT GCTTTTTCCT CGGATACGCG GCAATGGAGT CTGGAGAAAG ACAGGAAGCA
GGAGATGGGG GAATGCCGAA CCCAAAGGGC AGGAGGAATA AGCGATCGCT ATTCCCTAAC
AAGTGGTATT TGGCTTGTTA TGTGCTAACT GCTGGGGCAA TTTTGACTAA AGGCCCTGTG
GGTATTGTTT TACCGGGGAT CATTGTCTTG GTATTTTTGC TATATGTGGG GCAGTTGCGA
ACAGTACTGC GGGAAATGCG CCTCGTTTTA GGAACAGTTA TTATCTTAGG TTTATCTGTT
CCCTGGTATG CCTTGGTGAT TTGGCGTAAT GGTGAGAGTT ATATCAACTC TTTTTTTGGA
TATCACAATG TGGAGCGTTT TACGGAAGTA GTTAATGGTC ACTCGGCTCC TTGGTATTTT
TACTTTGTGA TAGTGACACT ATTTTTCGCC CCATATTCGG TCTATTTACC TTTAGCACTT
TTCAGACTGA AGTTTTGGCA GCGATCGCAC TGGCAAAATC AAGAACGTTC TCAACAGTTG
GGTTTATTTG CCTGTATTTG GTTTCTGAGC GTTTTTAGTT TCTTTACGAT CGCTGTTACC
AAACTGCCAA GCTACGTCTT ACCTTTAATG CCGGCGGCCG CTATTTTAGT AGCATTGTTA
TGGAGTGATT TCTTCCCCAG TGGTGAACAA ACAAACAAGA TAGAGATTAC TTATCCATCT
TCTCTTTTAC TAGCCAGTGG CTGGGTAAAT GTTATATTTT TAACCATTGT GGCAGTAGCG
TCGTTTCACA CATATCATCT GTTGGGTAAT GATGATGCAG CCCCCAACTT CCGCCAAAAT
TTACAAGATT CTGGATTACC GGCGATCGGC GGCTGGCTTT GGCTCGCCGG GGCAATTTTT
GTTGCTGTTT TAATATTGCG TCGCTATTGG CATTCTATTA TCGGCGTTAA TATGCTCGGA
TTCGCGGCCT TTTTGCTAGT TGTCACCATG CCAGCTTTGT TTTTGATGGA TCAAGAGCGT
CAACTACCAT TAAGAGACTT ATCTGCTGTT GTAGCTCAAG TACAACAACC AAAAGAAGAA
ATAATGATGG TTGGTTTCAA AAAGCCAAGT GTAGTTTTTT ATAGTCACAA ACAGATAAAT
TTTGTCCAGA CAACAGAAGA GGGTGTGGAA TATATCCATA ATTTAGCTAA TCAAGCAGTT
AAACCATCTT CCCTATTACT TGTGACTAAC AAAAAAAACT TTTTCAAAAT GGACTTACCG
CCAGATAATT ACGAAAATTT AGAAATTCAA GGTGCTTACC AATTGACTCG GATTAATTTC
AAGAAGATGA AAACTGAAAA AGTTAAAATT TCCTGA
 
Protein sequence
MRLKLSVLGA VERCFNNLVK RPALAVTASI LWLILIGWIG YGWNLGSVGL VDETEPLFAE 
ASRQMLVTGD WITPFFNGQT RFDKPALVYW CQAIAYAVFG VNEWAVRLPS ALAAMGAVSL
AFYTVHWSIT KKDELEQVTL PTRRYLTAGV AAGVMALNAQ MIVWGRTGVS DMLLTGCIAS
ALLCFFLGYA AMESGERQEA GDGGMPNPKG RRNKRSLFPN KWYLACYVLT AGAILTKGPV
GIVLPGIIVL VFLLYVGQLR TVLREMRLVL GTVIILGLSV PWYALVIWRN GESYINSFFG
YHNVERFTEV VNGHSAPWYF YFVIVTLFFA PYSVYLPLAL FRLKFWQRSH WQNQERSQQL
GLFACIWFLS VFSFFTIAVT KLPSYVLPLM PAAAILVALL WSDFFPSGEQ TNKIEITYPS
SLLLASGWVN VIFLTIVAVA SFHTYHLLGN DDAAPNFRQN LQDSGLPAIG GWLWLAGAIF
VAVLILRRYW HSIIGVNMLG FAAFLLVVTM PALFLMDQER QLPLRDLSAV VAQVQQPKEE
IMMVGFKKPS VVFYSHKQIN FVQTTEEGVE YIHNLANQAV KPSSLLLVTN KKNFFKMDLP
PDNYENLEIQ GAYQLTRINF KKMKTEKVKI S