Gene Ava_C0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0035 
Symbol 
ID3678111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp53901 
End bp56708 
Gene Length2808 bp 
Protein Length935 aa 
Translation table11 
GC content42% 
IMG OID637715119 
Producthypothetical protein 
Protein accessionYP_320313 
Protein GI75812696 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.981613 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCGA CCGCCTCCAC GAAAACAAAG GACAAAATTG GTAAAAAATC CCTCGATACC 
GAACAATTAG GTGTTGTCAA AAACCTAACA CCGTTCGAGG ATATTACTCA TTTAGCCGGG
ATTGCAAGTA TCTCACTTGC TGGTCGTAGA GATATCGGCG CACTCATCCT CCAGAAAAAA
GAAGATATTC AAATTAAGTT CTGCTTCGAC GTACAAGGAG TCCACCCGTC TTTACCAGAA
GAACAGATTC TCCCGATTTT TGAAAATATT GAGGGCGGAC TCAAAGAACT TCCCGAACGC
GAAACCTTAA CAATTCATCT TGGTTCGTTC ACTGATGATT TTCTCCGGCA GCAGGAACTA
AAAAAGATAG AGGATAACTG CGACCTAGAG CAACTAACTT TATTAGTGCG ATCGGAACGT
CTGCGGGCAA GAGAACTCAC TAAAGCAGGA ACTCGTAAAA ATAAGTTTTT ACGTTTCTGG
TGTACCTATA GTGTCCTGGC TTCAGAAGAC AGTCGCTCAA ATGATGCAAT TGAAAAAACT
ATCAAACAAT TGCAAAATGC CTGGTTCCAA TTTACTGGAC AAATTCACGG TGTTCGGCAA
GAACGAATTG AAACAATTTT ACGGGATAGT TTTACCTCTG GGTTTCAACG GTGGGAACAA
CTTTTGACAA ACTCGATGGG GCTTTCAGTA CGAGCAGTCA CGAGTGAGGA AGTATGGTCA
ATTCTTTGGA GCCAATTTAA CCGTAGCGAT GCACCCAAAG TTCCCAATCC CCTCATATTA
GATGAAGAAG GACTTCGAGA AGTTCAAACT AGCGATTTCC ACATTCGCCA CCTGATGCTG
GAGAATGAAA AATCTGTCCC GTTTCTCGAT CGCAGCTGGG TAAGACTGCA AGACAGGTAT
ATCGGCGTTC TTCAGTTTAG CGAAAAGCCC CAAGGCTGGG TAGATGAATA CGCCCAGCTT
CGATATCTGT GGGAAGTAAT ATCTAAAGAA AAAATCTCCG ACACTGAGAT TATCTGCCAG
CTATCTAAAG CCAACCAGGG ACTTGCCAAA ACTGCTCTGC AACGGATTAC CAAACAGTCA
ATTACCTCCA GTGCCATGTC TTCAGAGAAG GGGTCAATTG ACGTAAAAGC CAACATGAAC
ATCGAGGAGG CAGTTAAGGC ACAAGAAACT ATCCTCAGAG GCAGTCTCCC GATTCATACT
GCTGTTGTTT TTTGTGTCCA TCGCCACAGT CGTCAAAAAC TTGATGAAGC TTGTCAATAC
CTTTCTAGCT GCTTCTTGCG CCCTGCTGTA GTTGAGAGAG AAATCGAGTA TGCCTGGAAA
ACATGGTTAC AATGTGTACC TGTTGTTTGG GAAAACCTAC TTACAAGGCC ATTTAACCGT
CGCCTGCCTT ATTTCTCATC AGAAGTTCCC GGACTCATGC CGATAGTTAG AACAGCTACG
GGAGATAAAT CCGGGTTTGA GCTAATTGCT GAAGAGGGAG GAACCCCAGT ACATCTTGAT
TTGTACAAGC AGCACAAAAA TCTCGCCATT TTTGGTACTA CTCGTGCGGG TAAATCTGTC
TTAGTAGCAG GTCTATTAAC ACCTGCCTTG GCTCAAAACA TTCCCGTAAT TGCACTTGAT
TATCCCAAAC CCGATGGTAC TAGTACGTTT ACTGACTACA CCAAATTTAT GGGAGAGGAA
GGAGCGTATT TCGATATTAG TAAAGAGTCT AACAATCTAT TTGAATTACC TGATTTAAGG
GGGTATGAGC CTGAAGTTAT TAAAGAAAGG ATGACCGATT TCAAGGAATT CTTGAAATCC
ACATTAATGA TTATGGTACT AGGAACAAAT CCTGTAGGGG TAAGTCCGAC AACGGTATCA
AATATTGAAA GTATTATTAC AATCACAATT GAAACTTTTT ATAACGATGA AGATATCAAG
CTTCGGTACA AGCTAGCACT AGAAAATGGA TTAGGAACAC CTGAATGGGC AGATATTCCC
ACACTTAAAG ATTTCTATAA TTATTGTTCG CCTGGGTTTA TAAAACTCGA TTCCATCACC
AATAATAGTA AAGAAATTCT CGATGCCCTT GACCAAATCA GATTGCGATT AAAGTTTTGG
CTAAATTCAC GAGTTGGGCA GTCAATCGCC AATCCATCTA GCTTTAAAAC TGATGCTCGA
TTGCTTGTAT TCGCACTGCG ATCGCTTGGG AGTGAAGCCG ATGCCGCAGT CCTTGCCCTG
AGTGCCTATT CCGCAGCCTT ACGTAGAGCT TTATCATCTA AAGTATCAAT ATTTTTCTTA
GACGAAGCAC CGATTTTATT CAATTTTGAG AGCATAGCCG AGCTAATTGG TAGACTCTGC
GCTAACGGTG CAAAAGCAGG TATACGTGTA ATTTTATCTG CCCAAGAACC AGAAAGTATT
TTCCAAAGTA AGTCTGCTTC CAAGATATTT GCTAACATCA CAACCCGCCT GATCGGTCGG
ATTCAAACAT CGGCAGTTGA CCCATTTGTG AACAGGTTTA AATATCCTTA TGAAATTATC
TCGCGTAATA GTACCGAAGC ATTCTTTCCC AAACGCGAAA GTATTTATTC ACAGTGGTTA
CTTGATGACA ATGGCAAACT TACTTTCTGT AGATATTATC CTGCTTATTG CTTACTTGCA
GCAGTAGCAA ATAACCCGAA TGAGCAAGAA TTACGCACTC TATTTCTCAA TAAATATGCC
GATAATCCAA TGCTGGGTAT GGTGAAATTT TCAGAAAGCT ATATACAACT GCTTCGGGGA
GATGAGTTAA GTCAAGAAGC ACAACAATTA CTAAAAAATC AGCGATAG
 
Protein sequence
MSSTASTKTK DKIGKKSLDT EQLGVVKNLT PFEDITHLAG IASISLAGRR DIGALILQKK 
EDIQIKFCFD VQGVHPSLPE EQILPIFENI EGGLKELPER ETLTIHLGSF TDDFLRQQEL
KKIEDNCDLE QLTLLVRSER LRARELTKAG TRKNKFLRFW CTYSVLASED SRSNDAIEKT
IKQLQNAWFQ FTGQIHGVRQ ERIETILRDS FTSGFQRWEQ LLTNSMGLSV RAVTSEEVWS
ILWSQFNRSD APKVPNPLIL DEEGLREVQT SDFHIRHLML ENEKSVPFLD RSWVRLQDRY
IGVLQFSEKP QGWVDEYAQL RYLWEVISKE KISDTEIICQ LSKANQGLAK TALQRITKQS
ITSSAMSSEK GSIDVKANMN IEEAVKAQET ILRGSLPIHT AVVFCVHRHS RQKLDEACQY
LSSCFLRPAV VEREIEYAWK TWLQCVPVVW ENLLTRPFNR RLPYFSSEVP GLMPIVRTAT
GDKSGFELIA EEGGTPVHLD LYKQHKNLAI FGTTRAGKSV LVAGLLTPAL AQNIPVIALD
YPKPDGTSTF TDYTKFMGEE GAYFDISKES NNLFELPDLR GYEPEVIKER MTDFKEFLKS
TLMIMVLGTN PVGVSPTTVS NIESIITITI ETFYNDEDIK LRYKLALENG LGTPEWADIP
TLKDFYNYCS PGFIKLDSIT NNSKEILDAL DQIRLRLKFW LNSRVGQSIA NPSSFKTDAR
LLVFALRSLG SEADAAVLAL SAYSAALRRA LSSKVSIFFL DEAPILFNFE SIAELIGRLC
ANGAKAGIRV ILSAQEPESI FQSKSASKIF ANITTRLIGR IQTSAVDPFV NRFKYPYEII
SRNSTEAFFP KRESIYSQWL LDDNGKLTFC RYYPAYCLLA AVANNPNEQE LRTLFLNKYA
DNPMLGMVKF SESYIQLLRG DELSQEAQQL LKNQR