Gene Ava_2294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2294 
Symbol 
ID3678902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2851015 
End bp2852325 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content44% 
IMG OID637717639 
Producthypothetical protein 
Protein accessionYP_322807 
Protein GI75908511 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.284249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCCAG CTAAACGGGT TTATCTGCTG CTAATTTTGG GACTAGCGAT CGCGCCAATT 
TTATCATTGT TAATTGGTAT TCCTGCCAGT ATAGCGATCG CTCTATTATT CGATATCACC
GTCTTAGGAT TGATGATTGT CGATAGTCGG CAAGTACGTT CTCTGCGGGT AGAAGTTCAG
CGTCAATTAC CAGCACGTTT ATCTATCGGG CGTGATAACC CCGTAATATT GTCAATCACA
TCAGCAAAGA CTGAGGCTGT AGTTCAAATC CGCGATTATT ACCCGACTGG ATTTGGTGTC
TCTACACTGA CGGTAAATAC TACAATTCCT TCACAAGGGA AGGAAGAAAT CAAATATACT
GTCAACCCAA CACAGCGCGG GGAATTTTCT TGGGGAAATA TTCAGGTACG ACAGTTAGCG
CCTTGGGGTT TAGCTTGGGA TGATTGGCAG ATTCCCCAAA GTGTACAGGT GAAAGTTTAC
CCAGATTTGA TTGGATTGCG ATCGCTCTCT ATTCGTTTGA CACTACAATC ATCTGGTTCG
ATGCGCCAAT CACGACAGTT AGGGATTGGG ACAGATTTTG CCGAATTGCG GAACTATCGC
ACTGGTGACG ATTTACGCTT GATTGATTGG AAAGCTACCG CCCGTCGTGT GGGAGTGCCT
TTAGTGCGAG TACTGGAACC AGAACAGGAG CAAACCCTAA TTATATTATT GGATCGCGGT
CGGTTGATGA CTGCCAGAGT CAAAGGATTA CAACGATTTG ACTGGGGATT AAATGCAGCT
TTATCATTAG CTTTAGCCGG ATTACATCGT GGCGATCGCG TCGGCGTGGG TGTATTTGAC
CGTCTTATGC ACACATGGCT ACCACCACAA AGAGGTCAAC ATCATTTAAG TAAGCTAATT
GACCATCTGA CACCGATTCA ACCAGTATTA TTAGAGTCCG ATTATTTGGG CGCAGTAACA
AATGTTGTCA GACAACAAAC TCGTCGCGCA TTAGTAGTAG TAATTACTGA TTTAGTAGAT
GTCACCGCCT CTACTGAACT CCTCGCCGCA CTTACTCGAC TAGCACCACG CTATTTACCC
TTTTGCGTCA CCTTGCGAGA TCCACAGGTA GACAATTTAG CACATACTTT TACGGAAGAT
GTGAGCCAAA CTTACAACCG TGCAGTTGCT TTAGATTTAT TGGCGCAAAG ACAAGTTGCT
TTTGCTCAAT TAAAACAAAA AGGTGTCTTA GTACTTGATG CACCAGCCAA TCAAATTACC
GATCAGTTAG TTGATCGATA TTTGCAACTG AAAGCCCGTA ATCAACTGTA A
 
Protein sequence
MVPAKRVYLL LILGLAIAPI LSLLIGIPAS IAIALLFDIT VLGLMIVDSR QVRSLRVEVQ 
RQLPARLSIG RDNPVILSIT SAKTEAVVQI RDYYPTGFGV STLTVNTTIP SQGKEEIKYT
VNPTQRGEFS WGNIQVRQLA PWGLAWDDWQ IPQSVQVKVY PDLIGLRSLS IRLTLQSSGS
MRQSRQLGIG TDFAELRNYR TGDDLRLIDW KATARRVGVP LVRVLEPEQE QTLIILLDRG
RLMTARVKGL QRFDWGLNAA LSLALAGLHR GDRVGVGVFD RLMHTWLPPQ RGQHHLSKLI
DHLTPIQPVL LESDYLGAVT NVVRQQTRRA LVVVITDLVD VTASTELLAA LTRLAPRYLP
FCVTLRDPQV DNLAHTFTED VSQTYNRAVA LDLLAQRQVA FAQLKQKGVL VLDAPANQIT
DQLVDRYLQL KARNQL