Gene Ava_1116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1116 
Symbol 
ID3678531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1360588 
End bp1362813 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content42% 
IMG OID637716452 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_321635 
Protein GI75907339 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0704492 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTCAAA CTAATCTAAG TACAAATATA AACAGCGCTG ATTCAGAACC CAGTTACGGC 
AAAATCTTTT CAGTTTTTCT GCGTAGGTCC CCTTGGTTCT TAACGGCATT TTTAACAACA
ATTGCCCTAG CAGCCCTGAT GACTGCAAGA ACCAAGCCTA CTTACAAAAG CACAATGCAA
CTGTTAGTAG AACCTAACTA TCAAGGCAAA AGAGAGGGTG GTACTCCAGA GACCCAGTTT
CTCGAACCTG ATATACAAAT AGATACTGCT ACACAACTGA ACTTGATGCA AAGTTCAGGA
CTAATTCACA AAGCAGTTGA CAAATTGCGA TCAGAATATC CAAATATTAC TGTAGCTGAT
ATTAAAAATG CCTTAGTCTT AAATCAAGTT AGGACAAAAG AAGATAATGT CGCTACTAAA
ATTTTTCAAG TAGATTATAC TTCTGGCGAC CCTGAACAAA CTCAAAAAGT TCTCGGTGCT
ATCAGGCAAG TCTATGTGGA ATATAACAAA CAGCAGCAAG ACAACCGACT CAAAAAGGGT
CTGCAAGTTA TTAGAGAACA GTTGAGCAAA GCCAGTGAAG AAGTGAATGC AGCTGAAGCC
AACTTACAAA GATTCCGCAG AAACCAAAAT TTAATTAACC CAGAAGAACA AGCAAAAGCT
CTAGAAGTTG CTTTAAATAC TATTGAGCAA GAACGTCGTA CTACTCGTTC TACATACGAA
GAGGCTTTAG CAAAACAAAA ATCTTTAGAA GAGCAATTAA ATCGTTCTCC CAAAAATGCT
TTAGTTACTT CTCGTTTGAG TCAATCCGTC CGCTACCAAG GCTTACTCAA CGAAATTCAA
AAAACAGAAC TGTTACTAGC TCAAGAACGT TTACGCTTCA CAGATGAAAC CCCCAGTGTC
CAGAAGCTAA AAGAACAGTT GCAAAGCCAA AAAGAATTAT TGCAACAGGA AGTAGGTAGA
ACTCTAGGTG TTAAATCTGC TAATGCCTTC CGTGGAGGAG AACCGCTTTT AGAACAGGGA
CAGTTTGGGG AAATTGATCT GACTTTGGCT AGTCAATTAG TAGAAACCCA GACAACCATA
GTTTCATTGA CTGCCCGTGA TCAAACTCTG GCACAAAAAG AAAATCAACT ACGCCTAGAA
ATTAAACGTT TTCCGCCCCT GTTGGCTTAT TACAACCGTA TACAACCACA GTTGCAATTC
AGCCGTGAGA GACTAGAGCA GTTATTACGG GCAGAACAAC AACTACGGCA AGAACTGGCT
AAAGGGGGAT TTAACTGGGA AGTGGTGGAA GAACCCCAGT TAGGGATGAA ATTAGGCCCA
AATCTCCAAC AGAATTTGAT GTTGGGTGCT GTTGTGGGGT TAATGCTGGG AGGTATTGCT
GCCTTTATTC GAGAAGCTGC TGATGATTCT GTTCACACCA CAGCTGAGTT AGAAAAACAA
GTAGCTTTAC CTTTATTAGG AAGCACCCCG AAATTGCCAC CAGCAAAGGG TAGAGAATCA
GTCATCAAGT TGCCTTTTGG CAAGCCTGAG GTTTTAGGGC CTTGGACTGT GGAGGTGCTA
CAGTCTTCCA TCCGTTGGGA ATCGTTGGAT CTGATTTACA AAAATATCGA GCTGTTAAAC
TCGGTATCTA GCTTAAAGTC CTTGATGATT ACCTCACCTC TACTGGATAG AGGTAAATCT
GGTTTGGCCT TGGGTTTAGC TATGAGTGCA GCCCGGTTAC ACAAACGGGT GCTACTGATT
GATGCGAATT TGCGAGATCC TAGCCTCCAT GAACATCTCA ATCTTCCCAA CGACCAAGGG
CTATCAACTT TGTTAGCCAG CGAAATCACT TTACCCGAAC AGATTGGTAT TCATAACGTA
GGTTCTGCTT ACATCGATAT TTTGACAGCT GGTCCCTCGC CCAGTGACTC AGCAAATCTC
TTGAGTTCTC CACGGATGAA ACAGTTAATG GCAGCATTTG AGGAGAACTA CGATTTGGTG
ATTATTGATG CGCCACCAGT TATAGGTCTA GTAGATGCTT TGTTGACAGC ATCATCTTGT
AGAAGTGTGG TGATGGTAGC TAGTCTGGGA AAAGTGACTC GCAATCAAAT CGCTCAAGCT
ACAGCTATGT TGAGCAAGTT AAACCTGATT GGTGTTGTGG CTAATGGAGT TTCTAACTCT
GATAGCACCT ATGTACCTTA TGCGCGGGAA TACCGTTTTG CTTTGCAACA AGCGGTGGAA
AAGTAA
 
Protein sequence
MVQTNLSTNI NSADSEPSYG KIFSVFLRRS PWFLTAFLTT IALAALMTAR TKPTYKSTMQ 
LLVEPNYQGK REGGTPETQF LEPDIQIDTA TQLNLMQSSG LIHKAVDKLR SEYPNITVAD
IKNALVLNQV RTKEDNVATK IFQVDYTSGD PEQTQKVLGA IRQVYVEYNK QQQDNRLKKG
LQVIREQLSK ASEEVNAAEA NLQRFRRNQN LINPEEQAKA LEVALNTIEQ ERRTTRSTYE
EALAKQKSLE EQLNRSPKNA LVTSRLSQSV RYQGLLNEIQ KTELLLAQER LRFTDETPSV
QKLKEQLQSQ KELLQQEVGR TLGVKSANAF RGGEPLLEQG QFGEIDLTLA SQLVETQTTI
VSLTARDQTL AQKENQLRLE IKRFPPLLAY YNRIQPQLQF SRERLEQLLR AEQQLRQELA
KGGFNWEVVE EPQLGMKLGP NLQQNLMLGA VVGLMLGGIA AFIREAADDS VHTTAELEKQ
VALPLLGSTP KLPPAKGRES VIKLPFGKPE VLGPWTVEVL QSSIRWESLD LIYKNIELLN
SVSSLKSLMI TSPLLDRGKS GLALGLAMSA ARLHKRVLLI DANLRDPSLH EHLNLPNDQG
LSTLLASEIT LPEQIGIHNV GSAYIDILTA GPSPSDSANL LSSPRMKQLM AAFEENYDLV
IIDAPPVIGL VDALLTASSC RSVVMVASLG KVTRNQIAQA TAMLSKLNLI GVVANGVSNS
DSTYVPYARE YRFALQQAVE K