Gene Aazo_4917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4917 
Symbol 
ID9342724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5032283 
End bp5033551 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content39% 
IMG OID 
Productgroup 1 glycosyl transferase 
Protein accessionYP_003723176 
Protein GI298492999 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGGATC TCATTTTAGT ATTAGAAATA GATTCCTCAT CCATGCCTAC AGAACCTTAT 
AACTGGATAT CCTCCCAGAT TGGAGCCCGT GAGCATTACA CCATACCTAG AGCTTTACAA
CAGCAGGGAC AGCTTACTCA CTTGATTACA GATGCTTGGG TATCACCTCA ATCTGCACTA
AATTATTTAC CTAAATCTGT CTTAGCTAAT TTACGGGAAA GATACCACCC AGACTTAAAT
CAAGCATCTG TTCACGCTTT CAATAGTTCT TTAATTCGAT TTGAATTAAC TCAACGTTTG
CAAAAGAAGA TAGGATGGGA ACGAGTTATT GCTCGTAATC AATGGTTTCA ACAACAGACT
ATCAAGACCT TAGCAGGGCT ATCTGACCAG ATTACAACTC CTCTCATACT ATTTTCCTAC
AGCTATGCGG CTTTAGACAT TTTTCGATTT GCAAAACAAC AGGGATGGTA TACAGTCCTA
TGTCAAATTG ACCCAGGTTT GTGGGAAGAA AAAATGGTTA CTCAAGAGTA TGAACGATAC
CCCCAATATC GTGCAAACTG GCAACCAGCA CCTCCTGAAT ATTGGCAAAC CTGGAGAGAA
GAATGTACTC TAGCTGATGT TATTGTGGTC AATTCCAATT GGTCGAGTCA ACTCCTAGAA
AAAACAGGTG TTGAGCCTAA AAAAATTCAT ATCATTCCAT TAGTATATAC TCCACCAGAG
GCAGCTAGTA ACTTTGTCCG TACTTATCCA GAATTATTTT CTCAAGAGCG TCCTTTAAGG
GTATTATTTT TGGGACAGGT AATTTTGCGA AAAGGAATTG CGGCTGTATT AGATGCTGTT
CAATTACTTG AGGGATTTGC TGTTGACTTC TGGATTGTCG GCTCTGTGCA AATTGAGATT
CCATCTCATT TCCAAAACCA TCCGCAAATT CGTTGGTTAG GCCATGTTAA CAGGAGTAAA
ACAGCACAAT ACTACCAGAG GGCAGATGTG TTTTTGTTCC CGACTATTTC CGATGGGTTT
GGATTAACTC AACTGGAAGC ACAAGCATGG AAATTACCTA TTATTGCTTC TCGTTCCTGT
GGTGAGGTAG TTGTAGATAA TGTGAATGGT TGGATTTTGG AGGAAGTTAG TGGCAATAAA
ATTGCTAATT TGATACAGTC TATTTTGAGA GATTCAGCGC AATTACGATA TTTGTCTAAT
GGTTTAGCTT CACCATTAAA GTTTAGTCTT GCAAATTTAT TTCAGTCATT ACAAGATTCC
ATTCTCTAA
 
Protein sequence
MEDLILVLEI DSSSMPTEPY NWISSQIGAR EHYTIPRALQ QQGQLTHLIT DAWVSPQSAL 
NYLPKSVLAN LRERYHPDLN QASVHAFNSS LIRFELTQRL QKKIGWERVI ARNQWFQQQT
IKTLAGLSDQ ITTPLILFSY SYAALDIFRF AKQQGWYTVL CQIDPGLWEE KMVTQEYERY
PQYRANWQPA PPEYWQTWRE ECTLADVIVV NSNWSSQLLE KTGVEPKKIH IIPLVYTPPE
AASNFVRTYP ELFSQERPLR VLFLGQVILR KGIAAVLDAV QLLEGFAVDF WIVGSVQIEI
PSHFQNHPQI RWLGHVNRSK TAQYYQRADV FLFPTISDGF GLTQLEAQAW KLPIIASRSC
GEVVVDNVNG WILEEVSGNK IANLIQSILR DSAQLRYLSN GLASPLKFSL ANLFQSLQDS
IL