Gene Ava_1045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1045 
Symbol 
ID3678597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1271148 
End bp1273358 
Gene Length2211 bp 
Protein Length736 aa 
Translation table11 
GC content42% 
IMG OID637716381 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_321564 
Protein GI75907268 
COG category[D] Cell cycle control, cell division, chromosome partitioning
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0489] ATPases involved in chromosome partitioning
[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0728146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.010052 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCCC ATCAAGACGA TCAAGACCTG ATTAACTTCC AACAGTATTG GCTGATTGCA 
AAAAGACGTT GGTTACTGAT AGCCACGATT ATGGGATCGG TGTTTGGAGT CACAGGGTTG
CTTACTTTTA GTCAAAAACC AATCTATGAA GCAGAGGGTA AACTCCTCTT TAATAAGCAA
AGTGGCGTTT CTTCCCTAAC AGGTGTAAGT GAGCAATTGG GACAACTTAG TGGGGTGACC
AATCTTTCTA ACCCTTTGGA AACTGAGGCA GAAATTATTC GCTCAAACCC CATAATTCAA
AAAACCATTG CTGAGTCGCA GCTTAAAGAT GAACAAGGAC AACCTTTAGA AATAGATGAT
TTCCTACGAA GCTTAAAAAT AAAAAGTGTC CGAGGAACTG ATATCTTACA ACTTACCTAT
CGTAGTGCTA ATCCTGAAGA AGCAACAGCG ATTATTAATA GATTGATGAA CACTTATCTG
GAGAACAATG TTCGCAATAA TCGTTCTGAA GCTACAGCCG CAAGGGAATT TTTAAGCAAG
CAATTACCTC TAGTCGAGGC GAAAGTCACA GAAGCAGAAG CAGCACTACG TCGATTCAAA
GAAAAATACG AGGTTGTTTC CCTACAAGAA GAAGCTATAC AAGGAGTCAA AAGGCTGAAT
GATTTATCCG GCCAAGTGAC TCAACTGCGC GCACAGTTAG TTGATGCTAG AACTCGTTCT
GGTGCTTTGC AAAATCAATT AGCATTGAAC ACAAAGCAAG CTATGGCACT GAGCAGCTTA
AGCCAATCCA ATGCAGTACA ACAAGTGCTA TCAGAATACC AAAAGGTTCA AGACCAGTTA
GCTGTGGAGA GGTCACGATT TACTGAGGAA CACCCGGTTA TTGCTAATTT ATTAAATAAA
GAACAAGCTC TCAAAGAGCA GCTAGAAGGA AGAGTGAGCA AAACTTTAGG TAGTTGGCAG
CCGATTCCAG AGCAAGATTT ACAAATAGGG GAACTCAAAC AGACTCTAAC CGCTAATTTA
GTGCAGGTGG AAGTTGAACG ATTGGGATTA GAAAATCAAG TTGGTGTGTT GATGAAGGCA
TTTGTACTTT ATCAAGCACG TCTGAGAGTC TTACCCAAGT TGGAACAACA ACAACTACAG
TTACAACGAC AGCTACAGAT TGCCCAAACA ACCTATGAAC AAATGCTAAA ACGATTGCAA
GATGTTGAGG TGGTGGAAAA TCAGAATGTG GGTAATGCCA GGATTGTTTC TGAAGCTTTA
CTTCCCAAAA CACCAGTTTC TCCTCGGATT GTCTTGAATT TGGCATTAGG GGGATTTTTA
GGCTTCTTCT TAGCTATTGG TGCAGCTTTG TTGCTAGAGG CTGGAGACAA GTCTGTGAGG
ACACAAGAAG AAGCTCAACA GTTGCTGGAT TATCCTTTGT TGGGTACAAT TCCAGCTTTT
GACCAAAAAG CTAGACTGGC TCGTGGTGAG AGTATCACCG AATTACCCGT ACTCAATAAC
CCTTACTCTT CAGTTAATGC AGCCTTTGAA ATGCTGCAAA TCAATTTGGG CTTCTCCTTT
TCTGATAAGA AACTGAAGGT AATTGTGGTT AGCAGTTGTG TGATGAATGA AGGCAAGTCT
TTTATTGCTG CTAATTTGGC GGTAGCGACT GCCCAAATGG GAAGACGAGT ATTATTGATT
GATGCAGATA TGCGTCGCCC TCGTCAACAT GAAATGTGGC AACAGCCTAA CTTGATGGGT
TTAAGCAATG TTTTAGTGGG TCAGGCTACC CTAGCCGAAG CTGCTAAGGA AGTGGTAATT
AATCTAGAAT TACTTACTTC TGGTACCATA CCGCCTAACC CTGCGGCTCT ACTAGACTCA
CAACGTATGA ATGGGTTACT CCAACAAGCT GCTGAAGATT ATGACTACGT AATTATTGAC
ACTCCACCTT TAAGTGTTTT GGCGGATGCT TCGATCATAG GCAAAATGGC AGATGGAATG
TTATTGGTTG CACGTCCTGG TGTGCTTAAT TCTGCTGCGG CTAAGACTAC AAAGACACTC
ATTGAGCATT CGCGTGTGTC AGTGTTGGGA ATGGTGGTGA ATTGTGTAGC TACTGATAGT
AATGACTACG GTTACTACTA CTCCCATAAA AATACTGGAG ATAATAATTC TGGCAAAAAG
GATAGGATTA AGTCAAATTT AAGTAAAATC ACTGGATTGA GGCTGCTTTA A
 
Protein sequence
MPSHQDDQDL INFQQYWLIA KRRWLLIATI MGSVFGVTGL LTFSQKPIYE AEGKLLFNKQ 
SGVSSLTGVS EQLGQLSGVT NLSNPLETEA EIIRSNPIIQ KTIAESQLKD EQGQPLEIDD
FLRSLKIKSV RGTDILQLTY RSANPEEATA IINRLMNTYL ENNVRNNRSE ATAAREFLSK
QLPLVEAKVT EAEAALRRFK EKYEVVSLQE EAIQGVKRLN DLSGQVTQLR AQLVDARTRS
GALQNQLALN TKQAMALSSL SQSNAVQQVL SEYQKVQDQL AVERSRFTEE HPVIANLLNK
EQALKEQLEG RVSKTLGSWQ PIPEQDLQIG ELKQTLTANL VQVEVERLGL ENQVGVLMKA
FVLYQARLRV LPKLEQQQLQ LQRQLQIAQT TYEQMLKRLQ DVEVVENQNV GNARIVSEAL
LPKTPVSPRI VLNLALGGFL GFFLAIGAAL LLEAGDKSVR TQEEAQQLLD YPLLGTIPAF
DQKARLARGE SITELPVLNN PYSSVNAAFE MLQINLGFSF SDKKLKVIVV SSCVMNEGKS
FIAANLAVAT AQMGRRVLLI DADMRRPRQH EMWQQPNLMG LSNVLVGQAT LAEAAKEVVI
NLELLTSGTI PPNPAALLDS QRMNGLLQQA AEDYDYVIID TPPLSVLADA SIIGKMADGM
LLVARPGVLN SAAAKTTKTL IEHSRVSVLG MVVNCVATDS NDYGYYYSHK NTGDNNSGKK
DRIKSNLSKI TGLRLL