Gene Aazo_4868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4868 
Symbol 
ID9342675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4981585 
End bp4983063 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content36% 
IMG OID 
Productfamily 39 glycosyl transferase 
Protein accessionYP_003723137 
Protein GI298492960 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGCGA TATTCATCCT GTCATTTAGC TTACGATTTT GGGGATTAGA CCGATTTAAT 
ACCCTCGTAT TCGATGAAAT TTACTTTGCT CAATTTGGTA ATAACTATCT TACCCATACA
CCCTTTTTCA ATGCTCATCC TCCACTGAGT CAATATATAA TTGGTCTGGG CATTTGGATT
GGTAATCATA TTCCTTTTTG GCAACATACT GTCAATGGGT CAACAGGTTC TTTATTATCT
CCTTGGAGTT ATCGTTGGGC AAATGCTTTT GCAGGTTCAT TGATTCCTTT AATAGTTATT
TTACTAACTT ACCAGTTAAG TTATCGTCGT GGTTTTGCTT TATTAGCAGG ATTTATTACG
GCTTGTGATG GTTTGTTATT GGTAGAATCT CGCTATGCTT TAAGTAATAT TTATATAGTT
TTGTTTGGTT TGATTGGACA ATGGTTTTTT CTGTTAGCTT TAGGAAAGCA AAATAAACAA
CGTTGGTTTG CTTTAGTTAT TGCTGGGATT AGTTTTGGCG CTTCCGTTGG TACAAAATGG
AATGGTTTAT GGTTTCTAGT GGGTACCTAT GGGATGTGGA TAGCAGCTTG GATAATTCGG
TGGTTGCAAT CTTTTGCACC GTCATCTCGT GTTCCCTTAT TTTCTTATCT ACATCCTTTA
ATTAATTCTA CAAACCCTCA ATTTAACAGT GATGAAATTA ATATTCAAAC CCCACTACAA
AACCTAACTC AAATCAATAT TGTACAGATG TTATCCTGTT TGGGAATTAT TCCCGCAGCA
ATTTATAGTC TAATTTGGAT TCCTCACCTA CAACTAGATA CAAGGTATGG ATTTATAGAG
GTTCATAAAC AGATTCTACA GTTTCATCTT CAGTTGGGTG GTAATAGCTC TAGTGTACAT
CCTTACTGTG CAGCTTGGTA CAAATGGCCG TTATTAACTC GACCAATGGC TTATTATTAT
CAAACAGCTC AGAGTTTTAA AGATCCCCTT CCTGTATTTG GACCACCCTT ACCTCCTGGT
GCGGGGAAAG TTGTTTATGA TGTCCATGCG ATGGGTAATC CCTTTTTATG GTGGTTCGGC
TTGTCAGCGA TGATATTTTT AATAGGGATG CTATTTGCAA AAATAGTTAT ATCAGGGATA
CAACAAAAAC GGGTATTTAT TCCCAAAAAT ATGGGTGTTG ATACTTGGAT TGGTTTATAT
ATAGTTATCA ACTATGCTGC TAATTTTTTA CCTTGGGTGA AGGTAAATCG CTGTGTCTTT
ATATACCATT ATATGTGTGC AGTGGTTTTT ACATTTATTG CGCTCGCATG GTTTATTGAT
CAGTGTCTTC GTAGCTATTA TCCGAAACTG CGTATACTTG GTCTAACAAT AACCTTCACT
ATCATGGTTG CTTTCATTTT TTGGATGCCC ATTTATTTGG GGTTACCTAT TTCTACTGAT
GATTATAGAA TGCGGATGTG GTTTAATTCT TGGATTTGA
 
Protein sequence
MTAIFILSFS LRFWGLDRFN TLVFDEIYFA QFGNNYLTHT PFFNAHPPLS QYIIGLGIWI 
GNHIPFWQHT VNGSTGSLLS PWSYRWANAF AGSLIPLIVI LLTYQLSYRR GFALLAGFIT
ACDGLLLVES RYALSNIYIV LFGLIGQWFF LLALGKQNKQ RWFALVIAGI SFGASVGTKW
NGLWFLVGTY GMWIAAWIIR WLQSFAPSSR VPLFSYLHPL INSTNPQFNS DEINIQTPLQ
NLTQINIVQM LSCLGIIPAA IYSLIWIPHL QLDTRYGFIE VHKQILQFHL QLGGNSSSVH
PYCAAWYKWP LLTRPMAYYY QTAQSFKDPL PVFGPPLPPG AGKVVYDVHA MGNPFLWWFG
LSAMIFLIGM LFAKIVISGI QQKRVFIPKN MGVDTWIGLY IVINYAANFL PWVKVNRCVF
IYHYMCAVVF TFIALAWFID QCLRSYYPKL RILGLTITFT IMVAFIFWMP IYLGLPISTD
DYRMRMWFNS WI