Gene Aazo_3351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3351 
Symbol 
ID9341156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3420617 
End bp3421951 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content39% 
IMG OID 
ProductO-antigen polymerase 
Protein accessionYP_003722136 
Protein GI298491959 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGGGAG CCAGCTTGAA CAAGGCTTTT TATCATCCAG ATTCTAGTTT GCAAATCCCT 
TGGAACAGTC TTCAATTTGG GTTACTTGTC TTCCCACTCA ATCCATTTTT GGGGGCTGTT
ACAATATCGT TGGCAGCATT AATTACCTGG GCGAAAAAAT ACCGCCTAAT TATCCGCAGA
CCACTCCACC AAGGATTTGC AGTTCTCAGT TTATTGTTGT TGATAACCAC AGGGTTTGCT
TCTCATAAAC TAGAGGCTTT CCTCGGTTTA TTCAACTTAT TACCTTATTT TTTCGTATTT
GCTGCATTGA CCGCCCTAAT CCAAACATCA GCCCAATTAA GGCAAATAGC TTGGATTATA
GTATTTGGTT CTTTACCTAT GGTAATTATC GGCTTTGGGC AATTATTTCT AGGCTGGAGT
TTCAAATTAC AATTTCTCTG GGAGTTGATA GATTCAACAC TTTCCCCTGG AGGAAAACCA
CCAGGACGTA TGGCTTCTGT ACTGATGCAC GCCAACACTT TAGCCGCTTA TTTGGTGACT
ATTTTTATTT TGGGTTTAGG GTTGTGGTTA GAAAACTATC AAAAACTCAA ACAAAAACTC
AAAGTCAAAA ATCTTCCTAT CTCTTCATCT ACCTACGGAC CCGTTATTTT TCTGACAATA
GCAGTATTTG CTAATTTCAT CGCTTTGATT TTAACCAACT CCCGCAATGG CTGGGTAATC
GCTATTATTA CCTGTTTAGC TTATGCCTTG TACCAAGGTT GGCGGCTAAT TGTGGCTGGT
TGTATGAGTA TAGCCACGGT TATTCTTTTA GCAGCTTTTG CACCTTCTGC CATAGCTCAA
TTTTTCCGTC GCTTCGTTCC CTATTTTATC TGGGCGCGGT TAAATGATGA TATCTATCCT
GATAGACCAG TGGCCTTAAT GCGAAAAACC CAATGGGAGT TTGCCTGGAA TTTAACACAA
CAGCATCCTT TTACTGGTTG GGGTTTACGC AGTTTTAGTG GACTCTACAA AGCGAAAATG
GCAACTGACT TGGGTCATCC CCACAACCTC TTTTTAATGC TATCTGCGGA AACTGGTTTA
ATTACAACTT TTTTATTTTC TGGAATACTC GCTGGGATTT TATTTACCGC CAGTCAACTT
CTATGGAAAT CACAATCTTT AGAACCAGAA AACAGATTAA TATTTTTTAG TTATCTACTA
GCTGTTATTT CTTGGATACT ATTTAATACG GTAGATGTTA CCACTTTCGA CTTGCGTTTA
AGTATACTGT CTTGGGTGTT CATAGCCGCT TTATGTGGAG TAATTTTTCA CCAGAATTCA
CTAATTAAGA GGTAA
 
Protein sequence
MLGASLNKAF YHPDSSLQIP WNSLQFGLLV FPLNPFLGAV TISLAALITW AKKYRLIIRR 
PLHQGFAVLS LLLLITTGFA SHKLEAFLGL FNLLPYFFVF AALTALIQTS AQLRQIAWII
VFGSLPMVII GFGQLFLGWS FKLQFLWELI DSTLSPGGKP PGRMASVLMH ANTLAAYLVT
IFILGLGLWL ENYQKLKQKL KVKNLPISSS TYGPVIFLTI AVFANFIALI LTNSRNGWVI
AIITCLAYAL YQGWRLIVAG CMSIATVILL AAFAPSAIAQ FFRRFVPYFI WARLNDDIYP
DRPVALMRKT QWEFAWNLTQ QHPFTGWGLR SFSGLYKAKM ATDLGHPHNL FLMLSAETGL
ITTFLFSGIL AGILFTASQL LWKSQSLEPE NRLIFFSYLL AVISWILFNT VDVTTFDLRL
SILSWVFIAA LCGVIFHQNS LIKR