Gene Ava_C0023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0023 
Symbol 
ID3677788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp40106 
End bp41812 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content44% 
IMG OID637715107 
Productmajor outer membrane protein 
Protein accessionYP_320301 
Protein GI75812684 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.500646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0378832 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAATT ATTGGTTTGC AAATTTTCAT ATACTGGGTT GTTTGAGTGT TTTGAGCTTC 
GTCATCCTCA ATCATGCAAC TCCTTCCCTA GCCAAGACTG TTGAGTCTTT AACTTTTGAG
GAAAAACAAG CAACAGAGGA TGTCCCTACA CTTTCTCCTG TTGTTCCAAC TTCAATTACT
CAGCTAATTC ATCAAATTAT TCCTCAAGAA TCAGACGCAA CGCTACAAGG ACAAGTGACT
TCAGTAAATC AACTGGATGA TGTTCAACCG ACAGATTGGG CATTTCAGGT GCTTCAGTCT
TTGATCTCTC GATATAACAT TCTTACAGGC TATCCAGCCC AAACTTTTCG GGGCGATGTC
TCACGACAAG CCGCTTTGCA TCTAAGAGCA ATGACTCGCG ACGAATTTGC ACTAGCATTG
AACATAACAC TTAAACAAAT CAGCGAACAA ATTGCTCAAG GAACTGGTTC ACGAATATCT
CGTGATGATT TAGAGACACT ACAACGACTT CAAGAAGAGT TTTCTGGCGA ACTCAGCACA
CTACAAGAAC GTGTAGATGG ATTAGCAGCC CGAACTGCTA AGTTGGAAGC AAGTCAGTTC
TCGACAACCA CCACATTCTC AGGAATAGTT GTATTTGGAG TCACTGGCGG AGGCTTTAGC
GGCGATCGCA TTGTTGATGT CACAGGTAGA GAAATTGCGA CAAAAGATCC AAACCTTACA
TTCCTTTACC GAGCTACCCT AGACTTCACT ACAAGTTTTA ATGGAACAGA TGCGCTAGAA
CTCTGGCTGG AAATTGGCAG CAATGGGGCA GATGACAATG CAGCAGGATT GTTAGAACCC
AGTTTTGGCA GCGTTTTAGA CTATTCAGCC AAACCCCCTG TTGAAGAGTT TGGCGTGTCC
CGTCTGAATT ATACCTTTTC TCTATCTGAG GATTTGACGC TTTCCCTAGG CCCAGTCATC
AGTCTTACTG ACTATGTAGA CTTAAACCGC TATGCAAATG TCAGTTTTCT AGACTTCTCT
ACGCAAGCGT TGGTAAATAA CTATATTCTT TTCCCAGTTC AAGGGCTAGG AGCGGGTGCT
GCTATCAGGT GGAATCCGAA TGAAGGCGCA TTTACGGCAC GAGCTGCTTA TGTGGCGGCA
TCTGCGAGTC GGTCGAAAAT AGAGAGTTCA TCTCCAGTTC CTGGTATTTT CCCACTGGGA
TACATTCTTT ATCCCAACGG ACGAGGAGAA GGAGGGCTGT TTGGCGATCC TTATCAAGGA
ATCATTGAGT TAGAATACGC TCCTTCTAGA ATGTTTGCGC TACGCTTGCA ATATACGAGC
GGTAGTATTT TAGGAGGGAA CTTTGATGTC TTTGGAGCCA ACTTGGAATT GACGCTTTCA
GACCGTTTTG CTGTTTTTGG ACGCTACGGT TACGGTAGCT ACGCTGATAC TGCCTTTGGT
GATTTAAAGC CTAGCTATTG GATGGCAGGT GTAGCTTTTC TGGATCTATT CATTGAAAAT
GCTCTGGCAG GCATAGCTGT AGGTCAGCCG TTTATCGCAA GTGAAATAGG AGATTCAACA
CAAACGAATT TCGAGGCTTT CTACAATTTT CCAATTAATG ACAATATCCG TGTCACACCT
GTATTTCAAG TGATTACAAA TCCAGCTAAT CAAAGCGTCA ATGGCACAAT CCTTACAAGT
ACACTCCGCA CCGTCTTCTC GTTCTAA
 
Protein sequence
MQNYWFANFH ILGCLSVLSF VILNHATPSL AKTVESLTFE EKQATEDVPT LSPVVPTSIT 
QLIHQIIPQE SDATLQGQVT SVNQLDDVQP TDWAFQVLQS LISRYNILTG YPAQTFRGDV
SRQAALHLRA MTRDEFALAL NITLKQISEQ IAQGTGSRIS RDDLETLQRL QEEFSGELST
LQERVDGLAA RTAKLEASQF STTTTFSGIV VFGVTGGGFS GDRIVDVTGR EIATKDPNLT
FLYRATLDFT TSFNGTDALE LWLEIGSNGA DDNAAGLLEP SFGSVLDYSA KPPVEEFGVS
RLNYTFSLSE DLTLSLGPVI SLTDYVDLNR YANVSFLDFS TQALVNNYIL FPVQGLGAGA
AIRWNPNEGA FTARAAYVAA SASRSKIESS SPVPGIFPLG YILYPNGRGE GGLFGDPYQG
IIELEYAPSR MFALRLQYTS GSILGGNFDV FGANLELTLS DRFAVFGRYG YGSYADTAFG
DLKPSYWMAG VAFLDLFIEN ALAGIAVGQP FIASEIGDST QTNFEAFYNF PINDNIRVTP
VFQVITNPAN QSVNGTILTS TLRTVFSF