Gene Aazo_5072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5072 
Symbol 
ID9342881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5196353 
End bp5198170 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content41% 
IMG OID 
Productfamily 39 glycosyl transferase 
Protein accessionYP_003723290 
Protein GI298493113 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCTAA AATTGAGCAA TCGCTCCTCT GTTGACCAAT GGATTAATAA GATAGAACAG 
CGTCCAGCCC TTGCTGTGAC TATTTCAATA GTGTGGTTGC TGTTGATTAA TTACATCGCC
TTTGTCTGGA ATTTGGGCAA TATTGGCTTA ATTGACGAAA CTGAGCCGCT GTTTGCAGAA
GCTTCCCGGC AAATGCTAGT TACAGGTGAT TGGATTACAC CCTTTTTTAA TGGTGAAACT
CGTTTTGACA AACCAGCGTT AATTTACTGG TGTCAAGCGC TCGCCTACTC TATTATGGGG
GTGAATGAAT GGGCAGCACG CATACCCTCG GCATTAGCAG CAACGGGTGT GACAGCTTTG
GCATTCTACG GTATACACTG GCATTTTGCC AAAAAAGATC AATTAGAGCA AGTTGCAAAT
CCTAATCGTC GTTACTTAAC AGCAGCTATT GCATCAGCTT TAATGGCACT CAATCCCGAA
ATGATTGTTT GGGGGAGAGT TGGTGTTTCC GATATGTTAC TCACCGGTTG TATAGCCTCA
GCTTTGCTTT GCTTCTTTTT GGGATACGCT CAAAATTCTT CCCCTTCTCC CTTCCCCAAT
AAATGGTATC TGGCTTGTTA TGTATTGATG ACCGGAGCAA TTTTAACCAA AGGACCAGTG
GGAATAGTTT TACCAGGATT AATTATGATT GCCTTTGCCC TATACTTAGG CAAATTCTGC
GAACTGTGGC GAGAAATGCG CCCGATTTTG GGCATGGGAA TAGTCTTCGC TTTATCTGCT
CCCTGGTACA TCTTGGTGAC TTGGCGCAAC GGCTGGAATT TTATTAATAC CTTTTTTGTT
TATCACAACA TAGAACGCTT TACAGAGGTT GTGAATGGTC ACTCAGCCCC TTGGTATTTT
TATTTTTTGG TAGTATTGTT GGGTTTTGCA CCATATTCAG TTTTTATACC TATGTCCATA
GCCAGGTTAA AATTTTGGCA GCGCTCGCAC TGGAAAAATC AGGAACGTTC TCAACAATTG
GGTTTATTTG CCTGTTTCTG GTTTTTGGGT GTATTTTGCT TTTTCACCAT CTCCGTCACC
AAACTCCCCA GTTACGTATT ACCTTTAATG CCAGCAGCAG CCATTCTTGT AGCCTTATCT
TGGAGTAACC TGTACCCAAA CACACAAACT CCTCAAGCTT TCCACATCAG TAGTTGGGTG
AATGTGGCTT TTCTCTCAAC ACTTGGAGTG GCATTATTCA ACATATCCCA CATTATCGGC
AAAGACCCCG CTGCACCTGA ATTGTACGAA CAAATACAAA ATTCAGGAAT GGCTAATGTG
GGTGGTATAA TTTGGCTGAC TGGTGCTGTA ATTATCGCTA TTTTGATCCT CTCTTACCGT
TGGCGTGCCA TCATTACTAT TAATTTGGTG GGTTTCGTAG CATTTTTATC ATTGGTTTTA
ATGCCTGCTT TATTCTTGAT GGATCAAGAG CGTCAGGAAC CTTTAAGACA ATTATCTGCG
CTCGCAGTCA AAGAAAAACA ACCCAATGAA GAATTAGTCA TGGTCGGTTT CAAAAAACCG
ACCGTCACTT TCTACACTCA AAACAAAGTT AATTACCTGG AATTTTCCCA ACAAGCTTTA
GACCATATTT ACAATCAAGC AGCCAACAAA ACACATCCAG CATCACTGCT ACTTCTGACC
GAGCAGAAAA AGTTAATTGA TATGAACTTA CCACCAGATA TTTATAAAAA TATCGCCACC
AAAGGAGCTT ATAATCTCAT TCGTATTCCC TTGCAGAGAA TTAAACAAAA CAAAAAGGAA
AAAACAGACA TTTCGTAA
 
Protein sequence
MRLKLSNRSS VDQWINKIEQ RPALAVTISI VWLLLINYIA FVWNLGNIGL IDETEPLFAE 
ASRQMLVTGD WITPFFNGET RFDKPALIYW CQALAYSIMG VNEWAARIPS ALAATGVTAL
AFYGIHWHFA KKDQLEQVAN PNRRYLTAAI ASALMALNPE MIVWGRVGVS DMLLTGCIAS
ALLCFFLGYA QNSSPSPFPN KWYLACYVLM TGAILTKGPV GIVLPGLIMI AFALYLGKFC
ELWREMRPIL GMGIVFALSA PWYILVTWRN GWNFINTFFV YHNIERFTEV VNGHSAPWYF
YFLVVLLGFA PYSVFIPMSI ARLKFWQRSH WKNQERSQQL GLFACFWFLG VFCFFTISVT
KLPSYVLPLM PAAAILVALS WSNLYPNTQT PQAFHISSWV NVAFLSTLGV ALFNISHIIG
KDPAAPELYE QIQNSGMANV GGIIWLTGAV IIAILILSYR WRAIITINLV GFVAFLSLVL
MPALFLMDQE RQEPLRQLSA LAVKEKQPNE ELVMVGFKKP TVTFYTQNKV NYLEFSQQAL
DHIYNQAANK THPASLLLLT EQKKLIDMNL PPDIYKNIAT KGAYNLIRIP LQRIKQNKKE
KTDIS