Gene Aazo_2028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2028 
Symbol 
ID9339821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2103856 
End bp2106384 
Gene Length2529 bp 
Protein Length842 aa 
Translation table11 
GC content43% 
IMG OID 
Productnucleotidyl transferase 
Protein accessionYP_003721210 
Protein GI298491033 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGGAG TGCTCATGGC AGGTGGTTCG GGGACACGGT TACGTCCTTT AACTTGTGAT 
TTACCGAAGC CGATGGTTCC TATACTAAAT CGACCAATTG CTGAACATAT TATCAATCTA
CTCAAACGAC ATCACATTAC AGAAATTATT GCCACGTTAC ACTATTTACC AGATGTCCTC
CGAGATTACT TCCAAGATGG TAGTGATTTT GGGGTACAGA TGACCTATGC TATTGAAGAA
GACCAGCCTC TGGGTACAGC AGGTTGTGTA AAAAATATTG CTGAACTTTT GGACGAAACT
TTTTTAGTGA TTAGTGGCGA TAGTATTACA GATTTTGACC TCACTGCAGC CATTAAATTT
CACAAACAAA AACAGTCAAA AGCTACTTTA ATTTTAACCC GTGTTCCTAA CCCGATTGAA
TTTGGGGTGG TAATTACGGA TGAACAAGGA CGCATTAACC GATTTTTAGA GAAACCCTCG
ACTAGCGAAA TTTTTTCCGA TACAGTTAAC ACTGGTACTT ATATTTTAGA ACCAGAAGTT
TTGGAATATT TACCAGAACA CACAGAATCT GATTTTTCTA AGGATTTATT TCCCTTACTA
CTAGCAACAA ATGAACCTAT TTATGGTTAT GTAGCCCAAG GTTATTGGTG TGATGTGGGT
CATTTAGATG CTTATCGGGA AGCACAATAT GATGCTTTAG CCAGAAAGGT AAAACTGGAG
TTTGCTTATC AAGAAGCTTC TCCTGGGGTG TGGATAGGTC AAAATACTTA TATCGATCCT
AGCGCCAAGA TTCAAACTCC AGCTGTGATT GGTGATAATT GCCGGATTGG GGCAAGAGTT
CAAATTGACG ATGGAACGGT AATTGGTGAT AATGTCACTA TTGGGGCAGA TGCTAATTTG
AAGCGGCCTA TAGTTTGGAA TGGGGCGATT ATTGGGGATG AAGCCCAGTT ATCGGCTTGT
GTAATTTCCC GTGGTACTCG TGTAGATAGA CGTTCCCATG TATTAGAAGC TGCTGTAGTT
GGTTCGCTTT CTACGGTGGG AGAAGAGGCG CAAATTAGCC CTGGTGTGCG GGTTTGGCCG
AGTAAAAAGA TTGAGTCAGG TGCAATTTTA AACATTAACC TGATTTGGGG AAACACTGCC
CAACGGAACT TATTTGGTCA GCGTGGTGTA CAAGGTTTAG CGAATATTGA TATCAGCCCG
GAATTTGCGG TGAAGTTGGG GGCTGCTTAC GGTTCGACTT TAAAACCAGG TTCTAAGGTG
ACGGTTTCTC GTGATCAGCG TAATGTGTCG CGGATGGTAA CTCGTTCTTT AATTGCTGGT
TTGATGTCGG TAGGTGTGGA TATTCAAAAT CTTGATTCTA CTGCTATTCC TATTACTCGC
ACGGTGATCC CGATTATGGG GGTAGTGGGT GGTATTCATG TCCGTGTACA CCCAGACCGG
CCTGATTATA TCTTGATTGA ATTTATGGAT GGTAAAGGGA TTAATATTTC TAAGGCTCAG
GAAAAGAAAA TTGAGGGCGC GTATTTTAAG GAGGATATGC GGAGGGCGCA AAGTCACGAA
ATTGGTGATG TGGCCTATCC TAGCCAGGTG ATTGACCGCT ATTGTACTGC TTTCGAGAAG
CTGTTGAATG TTTCTACTCT TCGCAATAGT CGAGCAAAAG TTGTTATTGA CTATGTCTAT
GCGGTATCCG GGGCAGTGTT ACCGCAAATG CTAGATAAAT TTGGTGCTGA TGCGGTGGTA
TTAAATGCAA GTGTCAATAA AACCGCGATG ACAACTACTA GCCGGGAAGG ACTGCTGACT
CAGTTGGGTC ATGTGGTGGA AGCTCTGAAG GCTAATTTTG GGGTGCAGGT ATCAGCTAAT
GGGGAACAGT TGATTTTAGT GGATGAGTCT GGCTACCCAG TGCGGGGGGA AATCCTGACG
GCGTTGATGG TGGAAATGAT GTTAACGTCT AACCCTAGAT GCTCGGTAGT TGTGCCGGTT
CATGCTTCTA GTGCGGTGGA ACAAGTCGCG CGTCGTCATG ATAGTAAGGT AATTCGCACA
AAAGCAAATC CAACTGCTTT AATGGAGGCC TGTCAAAAAA ATCCCAATGT GGTTTTGGGT
GGTAGTGGGG AAACTGGTTT TATTTTCCCA CAATTGCATC CGGGGTTTGA TTCGATGTTC
TGCATTGCTA AGTTGATTGA AATGCTGACT ATTCAAGAGC GATCACTTGC ATCTGTGCGT
TCAGAATTAC CCCGTGTCAT TCACAAAGAT TATACCATTC GTTGTCCTTG GACTGCTAAA
GGGGCACTGA TGCGTTATTT GGTGGAAACT CACCCAGCCC AAAATTTGGA ATTAATTGAT
GGTGTGAAAA TTCGTCAACC CTATGATGAT AGTTGGGTGT TAGTTCTGCC CGATGCTAGT
GAACCAATGG TACATTTATT TGCTAACAGT AGCGATCGCG ATTGGGTTGA TGAGAGTTTG
AGAAGCTATC GCCATCGTGT TCAGACTTTT GTAGAAAGAG AACAGGAACA TTACACCGCA
GAAGTTTAA
 
Protein sequence
MRGVLMAGGS GTRLRPLTCD LPKPMVPILN RPIAEHIINL LKRHHITEII ATLHYLPDVL 
RDYFQDGSDF GVQMTYAIEE DQPLGTAGCV KNIAELLDET FLVISGDSIT DFDLTAAIKF
HKQKQSKATL ILTRVPNPIE FGVVITDEQG RINRFLEKPS TSEIFSDTVN TGTYILEPEV
LEYLPEHTES DFSKDLFPLL LATNEPIYGY VAQGYWCDVG HLDAYREAQY DALARKVKLE
FAYQEASPGV WIGQNTYIDP SAKIQTPAVI GDNCRIGARV QIDDGTVIGD NVTIGADANL
KRPIVWNGAI IGDEAQLSAC VISRGTRVDR RSHVLEAAVV GSLSTVGEEA QISPGVRVWP
SKKIESGAIL NINLIWGNTA QRNLFGQRGV QGLANIDISP EFAVKLGAAY GSTLKPGSKV
TVSRDQRNVS RMVTRSLIAG LMSVGVDIQN LDSTAIPITR TVIPIMGVVG GIHVRVHPDR
PDYILIEFMD GKGINISKAQ EKKIEGAYFK EDMRRAQSHE IGDVAYPSQV IDRYCTAFEK
LLNVSTLRNS RAKVVIDYVY AVSGAVLPQM LDKFGADAVV LNASVNKTAM TTTSREGLLT
QLGHVVEALK ANFGVQVSAN GEQLILVDES GYPVRGEILT ALMVEMMLTS NPRCSVVVPV
HASSAVEQVA RRHDSKVIRT KANPTALMEA CQKNPNVVLG GSGETGFIFP QLHPGFDSMF
CIAKLIEMLT IQERSLASVR SELPRVIHKD YTIRCPWTAK GALMRYLVET HPAQNLELID
GVKIRQPYDD SWVLVLPDAS EPMVHLFANS SDRDWVDESL RSYRHRVQTF VEREQEHYTA
EV