Gene Aazo_1310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1310 
Symbol 
ID9339105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1386803 
End bp1388083 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content45% 
IMG OID 
Productgroup 1 glycosyl transferase 
Protein accessionYP_003720706 
Protein GI298490529 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCCAGA CTACACAAAA TCGGGTTGCG CTTATTTCCG TTGATGGTGA TCCATCTGCA 
AAAATTGGTC AAGAGGAAGC TGGGGGTCAG AATGTTTATG TGCATCAAGT AGGTATGGCT
TTGGCTGAAC AGGGTTGGCA AATGGATATG TTCACTCGTC GCAGTTGTCC GAGACAGACA
ACAATAGTAC CCCATCATCC CAACTGTCGC ACCATTCGGT TAAACACTGG GCCTGCAGAG
TTTATTGGGC GAGATCATTT GTTTGACTAT CTACCAGAAT TTCTAGCGGA ATTTCAAAAA
TTCCAACAGC AGCAAGGATT ATATTATCCC ATAGTGCATA CCAACTACTG GTTATCTGCT
TGGGTGGGTA TGGAACTGAA AAAACGTCAA CCACTGATTC AAGTGCATAC TTACCACTCT
CTAGGGGCAG TTAAATACAG AAGTGTGGGT CATATTCCCG TGATTGCTAT CCAAAGGTTA
GCAGTGGAAC AAGCTTGTTT GGAAACAGTA GACTGTGTGG TTGCTACCAG TCCCCAGGAA
CAGAAGCATC TGCGAATACT AGTTTCTTAC AACGGACGGA CAGAAATGAT TCCCTGTGGT
ACTGATCTTC AGAAATTTGG TGGACTTCCC AGGTTAGAGG CTAGGGAAAA GCTAGGAATT
ACCCCTGATG CCAAAATGGT TTTTTATGTT GGGCGTTTTG ATCAGCGTAA AGGAATTGAC
ACTCTGGTAA AAGCCGTTGC CCAGTCCACG TTTAGAGATG AGGGAAAGGT GAAACTGGTA
ATTGGTGGTG GTAGTTGTCC TGGTTACATT GATGGAATGG AACGCGATCG TATTACCACC
ATTGTTGCAG AACTAGGACT GGAAGATATA ACCATCTTTC CCGGTCGCCT AGATCATAGC
GTCCTCCCTT ACTATTACAG TGCTGCTGAT GTCTGCGTTG TTCCTAGTCA CTACGAACCC
TTTGGTTTAG TGGCTATTGA AGCAATGGCT AGTCAGACTC CCGTTGTTGC TAGTGATGTC
CGTGGGTTGC AATTTACAAT TGTACCAGAG GTTACAGGTT TACTCGCCCC TCCCAAAAAC
GACGTAGCTT TTGCAGCAGC TATTGACCGC ATACTTGCTA ATCCATCTTG GCGTGACGAG
TTAGGTGTAG CTGGACGCGA ACGGGTAGAA ATTGCTTTTA GCTGGAATAG TGTGGGTTCT
CGACTATCTC AGCTTTATCT GCGGCTGATG ATGCAAGCAG CAGAACAATA TCAAAACCAA
ACTCAGATTC TTGCGGCTTA A
 
Protein sequence
MFQTTQNRVA LISVDGDPSA KIGQEEAGGQ NVYVHQVGMA LAEQGWQMDM FTRRSCPRQT 
TIVPHHPNCR TIRLNTGPAE FIGRDHLFDY LPEFLAEFQK FQQQQGLYYP IVHTNYWLSA
WVGMELKKRQ PLIQVHTYHS LGAVKYRSVG HIPVIAIQRL AVEQACLETV DCVVATSPQE
QKHLRILVSY NGRTEMIPCG TDLQKFGGLP RLEAREKLGI TPDAKMVFYV GRFDQRKGID
TLVKAVAQST FRDEGKVKLV IGGGSCPGYI DGMERDRITT IVAELGLEDI TIFPGRLDHS
VLPYYYSAAD VCVVPSHYEP FGLVAIEAMA SQTPVVASDV RGLQFTIVPE VTGLLAPPKN
DVAFAAAIDR ILANPSWRDE LGVAGRERVE IAFSWNSVGS RLSQLYLRLM MQAAEQYQNQ
TQILAA