Gene Ava_C0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0044 
Symbol 
ID3678120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp65758 
End bp67584 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content43% 
IMG OID637715128 
Producthypothetical protein 
Protein accessionYP_320322 
Protein GI75812705 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.612223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.249044 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTTC ACTTCACCGC ATATTTCTTC AATTTCCTCA TGCAATTAAC ACAGATTAAT 
ATGAATAATC ATCTGTTTGC AACTGCTAAA ACTAAGACAG TTAGAGAAGT AAAAGCAAAC
AATAAAAATA GCAATACTGA CTTTAGTAAA TATACAGATA AGCTCATGTC ACCTCAAGGT
TTGGCACTTG CTGGGGGAAT TGGCTTATTG TTGCTTCTAC AGCTTTTTAG TAATGGTAAA
AAAGGCAAGC TTGCTACTAG CTATTGGGGT GGAGCAAAGG AAACCGCTCA AGCTAAGAAA
AAAGCTCTCA AGCAAATCGT CGCTCCTAAA TGTGATAGTG CCAGTTTATA TATTGGAGTA
CATAGATACA AGGGTCAAAA ATCACCCCAG GGGAGTGGGG GAGTTCCAGT TTATGTACCC
GATGTCCAAC GCGGAACTGC TGTAATTGGC GCACCTGGTA GTGGTAAATC ATTCTCGGCT
ATCAACCCGA TGATTTACTC AGCAATTGAC CAAGATTTTG GTATCGTATT ATACGATTTT
AAGTATGCTA GTCAAGCCAA AATCGCCAGT TATGCCAAAT CCAAGGGGTA TGACGTACAT
ATCTTTGCAC CGGGATTTCC CGAATCAGAG GTATGTAACC CAATCGATTT CTTGCGTGAC
AGTAGCGATG CTGAAACCGC ACGGCAGTTA GCTACGGTAA TTAACAAAAA CTTCCGCCTG
TTAGGTAATG CGTCTGAGGA TGCCTTCTTT GGCCCCGCCG GCGACCAGTT GACCCAGGCT
ATTTTAATGC TAACCAAAGA GTTTGACGAT CGCGCTGATA TTATGACGGC AGCGGCAATA
CTTTCTAGCG AGAAGATGGT CGAACGCTTG ATGGCAGCCG AACTAAACCC CTGGATTAGG
ATAGCTTTTG GTCAATTATT TAGCTCGGCT GGCTCAGAGA AAACTGTAGC GGGTATCGCT
GGTAGTGCTT CATTGATGTT CACCCGGTTC ATGGCAAGGG GTACATTGGG ATGCTTTGTC
GGTAAAACAA CCTTGCCCTT GGAGGTAAAA CGCAAACAGA TGATTATCTT CGGGTTAGAC
CGCGAACGCC GCGATGCTGT CAGCCCTCTG ATGACCAGTA TTCTTCACAT GGTAGTTTCC
CGTAGTATTG CCAAAAAACG TAAGGAAGAC GGCCCATTAG TGGTGTGTCT TGACGAGTTG
CCAAGTATCT TCCTTCCAGA TTTATTTAGA TGGCTTAATG AATCTCGTTC CGAAGGATTC
TGCGGTATCC TGGGTTGGCA AAATATGGGT CAGCTTGAAA AGATTTATGG TAAAGAAATC
TCTAAAGCTA TCCTTGGTGC GTGCGGTACA AAATTTGTTT TTAATCCTGG TGAAGAAGAA
TCAGCACGAT TATTTTCTGC ATATTTAGGT GAAGAGGAAA TTAAATATAA ACAAAAATCC
CGCTCAACGG GAGGGGGTAA AGCCAGCACA TCTATCAGCG AACAAGAACG GACTCGCAAG
CTATTCGAGC CAGCCCAATT CCTTAAATTA CCACCTGGTA AATGTCTATT TATCAACCCA
GCTTATAGCA ATAAAAATGA GGGTTCAGTA CCACTATTAA AAAATATCAG AATCCCCAAA
TATATCATTG GATTAGAACA AGAAAATGAC GCTAACTGGG ATAAACTCAT CAAAGAGCTT
GCTAGGAAAA GTACCCAGAA AAAACCCACA CAGGAAGATT TAGATTTACG AGTCAAGGAA
GTAGATCGGA AGTTCCCTAT CCCACAAGCA CCCGTACAAG CGGGCAATGC ACCTTTACCT
GTTGATGCGT ACAAGAGCTT TTTCTAA
 
Protein sequence
MQLHFTAYFF NFLMQLTQIN MNNHLFATAK TKTVREVKAN NKNSNTDFSK YTDKLMSPQG 
LALAGGIGLL LLLQLFSNGK KGKLATSYWG GAKETAQAKK KALKQIVAPK CDSASLYIGV
HRYKGQKSPQ GSGGVPVYVP DVQRGTAVIG APGSGKSFSA INPMIYSAID QDFGIVLYDF
KYASQAKIAS YAKSKGYDVH IFAPGFPESE VCNPIDFLRD SSDAETARQL ATVINKNFRL
LGNASEDAFF GPAGDQLTQA ILMLTKEFDD RADIMTAAAI LSSEKMVERL MAAELNPWIR
IAFGQLFSSA GSEKTVAGIA GSASLMFTRF MARGTLGCFV GKTTLPLEVK RKQMIIFGLD
RERRDAVSPL MTSILHMVVS RSIAKKRKED GPLVVCLDEL PSIFLPDLFR WLNESRSEGF
CGILGWQNMG QLEKIYGKEI SKAILGACGT KFVFNPGEEE SARLFSAYLG EEEIKYKQKS
RSTGGGKAST SISEQERTRK LFEPAQFLKL PPGKCLFINP AYSNKNEGSV PLLKNIRIPK
YIIGLEQEND ANWDKLIKEL ARKSTQKKPT QEDLDLRVKE VDRKFPIPQA PVQAGNAPLP
VDAYKSFF