Gene Aazo_5204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5204 
Symbol 
ID9343011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5328898 
End bp5329992 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content39% 
IMG OID 
Productgroup 1 glycosyl transferase 
Protein accessionYP_003723364 
Protein GI298493187 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGAG TAAATAATTC CTTCTCTAAT CAATTAATAA TAAATTTATC TATTATCTTC 
TCTCAACCAA CCGGCATCAG CAACTATGCT CTAAATTTAT TTCCCTATTT ACAATCTCTC
CAACCTACCC TATTAACAGC GCAAAAATAT TCTGAATTCA ACTGCTACCC AGTCCCAAAT
GATCTTACTC CTGCTGACGG TATTAAAGGA CATTTACGCC GGCTAATGTG GACACAATTT
CAACTGCCAA AGATATATAA AAACTTTAAA TCTCAACTTT TATTTTCCCC CATACCAGAA
GCACCTCTTT ACAGTAACTG TCGTTTTATC ATCATGTCTC ATGACATGAT ACCATTACAC
TTTCCCAAAC CATTTTCACC GCTAACACCA TACCACCGTT ACTATACTCC CCAAGTGTTT
AAGGAAGCAC AACATATTAT TTGTAACTCA GAAGCAACCG CTAAAGACAT CACCGAATTT
TACCAAATAC CCACCAGTAA AATCACACCT ATTCTCCTAG CACACAATCG CACTCACTTC
CGTTGTCTGA ACCTACCCAC CAGTAATTAC TTCCTATACA TCGGTCGTCA AGACCCTTAC
AAAAACTTGC AGCGACTCAT CAGTTCCTTT GCTGCGCTAC CTAATAAGGG AGATTATGAA
CTATGGTTAG CAGGTCCCAC TGATAAACGT TACACCCCAT TATTGCAAGC GCAAGTTGAA
GAACTGGGTA TCACTCATCG TGTCAAATTC CTCAACTACG TACCTTACAG TGAACTACCA
ACAATCATAA ATCAAGCAGT TGCTCTCGTT TTTCCGAGTT TGTGGGAAGG GTTTGGTTTT
CCTGTCCTGG AAGCAATGGC TTGTGGAACT CCCGTTATTA CCTCTAATCT TTCTTCACTT
CCCGAAGTAG CTGGTGATGC TGCTATTCTC ATTAATCCTC ATAACACAGG GGAAATCACA
GAAGCAATGC AAGCAATTAT CAATGATTCA GGAATGAGAA AACAACTTTG TCAAAAAGGC
ATAGAGAGAG CAAATTTGTT TAGCTGGGAA AAAACCGGAC TTGCTACAGC AGAAGTTTTA
AAACAATATT TCTGA
 
Protein sequence
MNRVNNSFSN QLIINLSIIF SQPTGISNYA LNLFPYLQSL QPTLLTAQKY SEFNCYPVPN 
DLTPADGIKG HLRRLMWTQF QLPKIYKNFK SQLLFSPIPE APLYSNCRFI IMSHDMIPLH
FPKPFSPLTP YHRYYTPQVF KEAQHIICNS EATAKDITEF YQIPTSKITP ILLAHNRTHF
RCLNLPTSNY FLYIGRQDPY KNLQRLISSF AALPNKGDYE LWLAGPTDKR YTPLLQAQVE
ELGITHRVKF LNYVPYSELP TIINQAVALV FPSLWEGFGF PVLEAMACGT PVITSNLSSL
PEVAGDAAIL INPHNTGEIT EAMQAIINDS GMRKQLCQKG IERANLFSWE KTGLATAEVL
KQYF