Gene Aazo_2950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2950 
Symbol 
ID9340754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3033604 
End bp3034884 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content44% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003721884 
Protein GI298491707 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTTC AATTACTATC GGCACTCAAC GATACTAATG TTGACGCTAC TCAATTGAGT 
AACCAACGTC AACTGGCAAT TTCCATTTCT TCCATTGCTG ATGAGCTTGA CCCAAGTTTA
CCGTTGAATT TATGCCTGAT TCTGGATAAA AGCGGTTCTA TGCACGGCGA ACCCATTAAC
ACCGTAATTC AGGCTGTAGA ACAATTATTA GCTCAACTCC AGCCAGGCGA TCACATCTCA
ATTGTCGCGT TTGCAGGTAC TTCTGAGGTC ATTATCCCTA ACCAAATCGT CCAAGATGCT
GAGAGCATCA AATGCCAGTT GCACAAAAGA CTCAAAGCTG GTGGTGGCAC AATCATTGCC
GAAGGTTTAT CTTTGGGAAT TACTGAATTA CTCAAAGGGA CAAAAGGCGC TGTTTCCCAA
GCATTTTTGT TGACAGATGG ACATGGTGAC AGGGGGTTAA AAATTTGGAA GTGGGAGATG
GGCCCCAATG ACAAGAAACG TTGTTTGGAA CTAGCACAAA AAGCCACTAG AGTAAGCCTA
ACGCTCAACA CCTTCGGTTT TGGCAATGAC TGGAACCAGG ATTTGCTGGA AAAAATTGCT
GATGCTGGTG GTGGTACTCT GGCTTATATT GAGCGTCCAC AACAAGCCGT AGATCAATTT
AGTCGCTTGC TTAAGCGAAT TCAGTCTGTG GGCTTAACCA ATGCCCACTT ATTGCTGTCT
CTAGTCCCTA GTGTGCGTTT AGCAGAACTA AAACCCATTG CCCAAGTTGC CCCAGAAACC
ATTGAGTTAC CAGTGGAAAC AGAACCCAAT GGTAGCTTAA TTGTGCGTTT GGGAGACTTG
ATGAAAGATG TAGAACGGGT AGTTTTAGCG AATATTTATT TGGGACAGTT GCCAGAAGGG
GAACAAGTAA TTGGACATAT CCAAATACGC TATGATGACC CAGCTATTAA CCAAGAAGGT
TTACTTTCCC CACTCATACC AGTTTATGCC AATTTCACTA AAACTTACCA ACCACTACTT
GATTCACAAG TGATCAAATC AATTTTGGTA TTGGCAAAAT ATCGCCAAAC TCAAGTAGCA
GAAGCAAAAC TGGAACAGGG TGATCGCACT GGTGCTATCA CAATGCTACA AACAGCCGCT
AAGACTGCTT TACAAATCGG TGATATTGGT GCAGCAACAG TGCTGCAATC TTCCGCTACT
CGTTTGCAAG CCGGTGAAGA ACTCTCTGAA GCAGACCGCA AAAAAACCAG GATTGTCTCG
AAAACTATTG TGAGGGAGTG A
 
Protein sequence
MKVQLLSALN DTNVDATQLS NQRQLAISIS SIADELDPSL PLNLCLILDK SGSMHGEPIN 
TVIQAVEQLL AQLQPGDHIS IVAFAGTSEV IIPNQIVQDA ESIKCQLHKR LKAGGGTIIA
EGLSLGITEL LKGTKGAVSQ AFLLTDGHGD RGLKIWKWEM GPNDKKRCLE LAQKATRVSL
TLNTFGFGND WNQDLLEKIA DAGGGTLAYI ERPQQAVDQF SRLLKRIQSV GLTNAHLLLS
LVPSVRLAEL KPIAQVAPET IELPVETEPN GSLIVRLGDL MKDVERVVLA NIYLGQLPEG
EQVIGHIQIR YDDPAINQEG LLSPLIPVYA NFTKTYQPLL DSQVIKSILV LAKYRQTQVA
EAKLEQGDRT GAITMLQTAA KTALQIGDIG AATVLQSSAT RLQAGEELSE ADRKKTRIVS
KTIVRE