Gene Ava_4841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4841 
Symbol 
ID3679339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6093465 
End bp6094589 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content41% 
IMG OID637720198 
Productglycosyl transferase, group 1 
Protein accessionYP_325333 
Protein GI75911037 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000142373 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0561788 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAAT ATCGAATTAA AGTCGTTTTA TTACATTTTT CTTTTAGTGA ATATACTGTT 
CAATTGGCAA ATAATTTAGT TAAATATGTT GATTTAACAC TGATACATTC AGAAGAAATA
TATAGGCAAT GCAAAGATGT TCTTAATCCT CATATTCGAG TGATTCAAAT TAAGAAACCC
CGCATTCGTG ATCCTCGCAA CATTAAGGTG ATAGCAGCAA TGATGCGAAT GATTCGAGAG
ATAAACCCTG ATGTACTCCA TGTTCAAGAA ACTAACGATC CTTGGTATGA TTTAACTCTT
TTATTCAGTA AAATGCCCCC TCTGGTAACT ACAATTCATG ATGTATATCG TCACCCAGGC
GATCGCGATT TAACACCGGG CGCTGAATAT ACTCGCAGAA TAGCTTTCTA CCGTTCCCAG
CAATTAATTG TCCACTCCCA GTCACTTCAA GATATCCTCA TCAAACAGTT CCGCTTACCT
CAACAGCGAA TTAACGTCCT ACCTCACGGA GAGTTAGGTA GTTTGTTTCA AAGTTGGTCA
AGTGGTCAAA TAGCACCTCG TGAGCCTCAT ACATTACTAT TTTTCGGGCG TATCTGGCCA
TACAAGGGTC TAAAATACTT GCTGCAAGCT ATCCCCTTAG TTGCAGAACA CATCCCCGGA
GTCAAACTTA TCATTGCTGG ACGGGGAGAA AATGTCAGTG AATTATTGCA GGATGCAGAC
AAAAAACACT ACGAAATTCT CAATAACTTT ATCCCTACCG GAAATGTCGC CAATTTATTT
CAACGAAGTG CGGCTGTTGT TCTACCTTAC ATTGAATCTT CACAAAGCGG TGTAGCAGCG
ATCGCCTATG CGATGGGTAC TCCTGTAATT GCTTCTAATA TTGGCGGTTT GAGGGAAATA
GTCCGACATG AACAAGACGG ACTGCTAGTA CCACCGTGTG ATGTCCAGTC TCTTGCAGAT
GCAATCATTC GGCTATTAAG TGACTCTCAC TTACAACGTC AGATGCAAAT CGCCGCATTA
GAGCGTTGTC AACAAGACTT GAACTGGTCA AACATTGCAG CTCAAACAAT CGAAGTTTAC
CATCAAGCGA TCGCCGCCAA AAGTACATCT TTGATGACCA GATGA
 
Protein sequence
MGKYRIKVVL LHFSFSEYTV QLANNLVKYV DLTLIHSEEI YRQCKDVLNP HIRVIQIKKP 
RIRDPRNIKV IAAMMRMIRE INPDVLHVQE TNDPWYDLTL LFSKMPPLVT TIHDVYRHPG
DRDLTPGAEY TRRIAFYRSQ QLIVHSQSLQ DILIKQFRLP QQRINVLPHG ELGSLFQSWS
SGQIAPREPH TLLFFGRIWP YKGLKYLLQA IPLVAEHIPG VKLIIAGRGE NVSELLQDAD
KKHYEILNNF IPTGNVANLF QRSAAVVLPY IESSQSGVAA IAYAMGTPVI ASNIGGLREI
VRHEQDGLLV PPCDVQSLAD AIIRLLSDSH LQRQMQIAAL ERCQQDLNWS NIAAQTIEVY
HQAIAAKSTS LMTR