Gene Aazo_3301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3301 
Symbol 
ID9341105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3380967 
End bp3382376 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content44% 
IMG OID 
Productfamily 2 glycosyl transferase 
Protein accessionYP_003722101 
Protein GI298491924 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.144563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGCGA ATTCCTGGCC GGAAAACGAC TCTGATAACG GAACCTCTGC TCCTCTTAAC 
TCCCTAGTGT CTGACCTATC AGCACAACAA GAGTTAGTGG AAGACGCGAA TGCTATGTCT
GTACTTTCAC ACCGATTTAA ACGACGTACA CCCAAAGCCG CCCTGGTCTT GACTATTGTC
TGGAGTGGGA CGATCGCTTT GCATTTAGTT TCCTGGGGTT ATATTTTCAT TCTCGGACTG
ACAACTATCC TTGGTATTCA CGCCTTGGGT ATTATTTTTG CTAGACCCCG CCACCATCAC
AAAGAAATAC ACGGAGATTT GCCTTCTGTA TCTTTGTTGG TGGCTGCAAA AAATGAAGAA
GCAGTTATTA CTAGATTAGT CAAGAGTCTT TGTAGTCTGG AATATGCCAA TGGGGAATAT
GAAGTCTGGA TTATTGACGA TAATAGTACG GACAATACGC CGCATTTATT GGCAGAACTG
AAGCAGGAAT ACAAGCACCT CAAGGTGTTC AGACGTTCTG CTGAGGATAG TGGTGGTAAG
TCAGGAGCTT TAAATCAAGT CCTACCAATG ACAAAGGGGG ATATTATTGT GGTATTTGAT
GCTGATGCCC AAGTTAACCC AGATTTACTA TTACAGGTAG TGCCTTTGTT CCAAAAAGAA
CAGGTGGGGG CGGTGCAGGT GCGAAAAGCG ATCACCAACG CTAAGGAGAA TTTTTGGACT
AAGGGACAAA TGGCAGAAAT GGCTGTTGAT ACTTGGTTTC AACAACAACG GACTACTATT
GGTGGTCTTG GTGAACTGCG GGGTAATGGT CAATTTGTCC GTCGTCAAGC TTTGGATGGC
TGTGGTGGCT GGAATGAGGA AACCATCACC GATGATTTGG ATTTGACAAT TCGCCTGAAT
CTGGATAAAT GGGATATTGA ATGTATGTTC TATCCCCCAG TGCAAGAAGA AGGAGTCACA
AATGTGATCG CTCTTTGGCA TCAACGTAAC CGTTGGGCTG AAGGTGGTTA TCAGCGTTAT
TTAGATTACT GGGATCTGAT CCTTCAAAAC CGGATGGGGA CGCGGAAAAC CTGGGATTTG
CTGATTTTCC TCCTGATTAT GTATATCCTA CCCACAGCAG CAATACCAGA TTTATTAATG
TCTCTAATTC GCCATCGTCC ACCAGTATTA ACCTCTGTAA CTGGTCTGTC AGTTACGATG
TCTTTTGTGG GGATGTTTGC TGGTTTAAGG CGGACACGCC AAGATCAGAA AACATCTAAC
TATTTTGTGT TACTTCTACA AACCATTCGC GGTAGTATTT ATATGTTGCA TTGGTTGGTA
GTTATGAGTA GCACTACCGC CCGGATGTCA GTACGTCCCA AACGTCTAAA ATGGGTGAAA
ACCGTGCATA CAGGTGTTGA GAAAGATTGA
 
Protein sequence
MPANSWPEND SDNGTSAPLN SLVSDLSAQQ ELVEDANAMS VLSHRFKRRT PKAALVLTIV 
WSGTIALHLV SWGYIFILGL TTILGIHALG IIFARPRHHH KEIHGDLPSV SLLVAAKNEE
AVITRLVKSL CSLEYANGEY EVWIIDDNST DNTPHLLAEL KQEYKHLKVF RRSAEDSGGK
SGALNQVLPM TKGDIIVVFD ADAQVNPDLL LQVVPLFQKE QVGAVQVRKA ITNAKENFWT
KGQMAEMAVD TWFQQQRTTI GGLGELRGNG QFVRRQALDG CGGWNEETIT DDLDLTIRLN
LDKWDIECMF YPPVQEEGVT NVIALWHQRN RWAEGGYQRY LDYWDLILQN RMGTRKTWDL
LIFLLIMYIL PTAAIPDLLM SLIRHRPPVL TSVTGLSVTM SFVGMFAGLR RTRQDQKTSN
YFVLLLQTIR GSIYMLHWLV VMSSTTARMS VRPKRLKWVK TVHTGVEKD