Gene Aazo_4314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4314 
Symbol 
ID9342119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4386384 
End bp4388741 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content40% 
IMG OID 
ProductUvrD/REP helicase 
Protein accessionYP_003722791 
Protein GI298492614 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAAG ATAATTTTAT GGTTATAGTC AATCAAGAAA GCCTGGAAAT AGAAGATATT 
CAGCCATCAC CAGCAGCTAA GTTACGAAAA AATATTGACA AAATTCGTCA TAGTCTGCGA
CCTGGACAAA AGCAAATGGC TAATTGGGAA TCGGGTCCAT TGGCTGTATC TGCTGTGCCT
GGTGCTGGTA AATCTACAGG AATGGCAGCA GCAGCAGCCA TAGCAATAGC TCGCCAGTAT
CAGTATTCCC TCACCACAGG TAATAATTAT CGGCGTCAGT TAGTAGTTGT CACCTTTACC
CGTTCTGCTG CTGCTAACCT AAAATTAAAA ATCCGCGATA AACTTAAGAA ACTATCCTTA
CCACAAACAG GCTTTGCTGT TTATACCCTC CACGGTTTAG CCTTAAATAT AGCCAATCGT
TATCCTGATT TATCCGGGTT GCAGTTAGAA AACGTCACAT TAATCACCCC TAATCAAACT
CATCGCTTCA TCCGCGCAGC CGTAGAACAA TGGATTACAA AAAATCCTGA ACTTTATCGA
CGCTTACTAG AAGGTCAAAA ATTTGATGGT GAAGAAACAG AAAAACTACG TCGTCAGTCT
GTGCTACGCA CAGAAGTATT ACCAGACTTA GCTTATACGG TCATTCACGA AGCTAAAAGT
TCGGGAATAT TACCGGAAAA ACTCTATAAA TGGAGTGAAA GAACCCAAGA TACCTATCAA
ATTTTACAAG TGGCAGCGGG ATTGTATGAA CAATATCAAA AATTAATGGC TTCCCAAGAT
TTCATTGACT ACGATGATAT GATGTTAGCC GCTTTACGCG TCTTAGAAAA TTCTAGCGCC
AGACGGATTG AACAAAATCA AATTTTCGCA ATTTTTGAAG ATGAAGCCCA AGATTCTAGC
CCACTACAAA CGCAGCTTTT AGAAATACTA GCTGGTGAAG AAGACGGCAA TAATGCAGAT
CAGTCAACAC TCAATTTAGT CAGAGTTGGT GATCCTAATC AGGCGATTAA TTCAACTTTT
ACACCTGCTG ATCCGATTTA TTTTCGGGAG TTTTGTAAGA ATTGTGATAT TAATCAACGA
CTAGCAACAA TGAATCAAGC TGGTCGTAGT AGCAAAATTA TTATTGATGC AGCCAACTTT
GCATTGGAGT GGGTAAATAG TCAATGGTCA ACCATAACTA AGAACGGACA AACACCATTT
CTCTCCCAGA AAATTGCGGC TGTTAATATG GGTGAACCCC AACTAAATGC TAATCCTGCA
CCATTTGGTA AAGGTCTGGA ATTGTATAGC CCTCGTGATA TTCATCATAC AGTTGAGTTG
CTTTCCCAGA AAGTAATTGA ATTATTTACT CAAAACTCTG ATTCCCGTGC GGCGGTATTA
GTCAGGGAAA ATCGTCAAGG GAGATGGTTA ACATCGGCTT TAGAAGCTAT TTGTCAAGAA
CATCAAATTA AACTTTATGA TGTGGGAGAA ATGGAAAGAC GTTCTCATGT TCCACAAGAA
ATTTTATCAT TATTGCAATT TTGCGATCGC CCGCATTCCT CTGATTATTT AAAAGCCACT
CTCCAGGTTT TAGTACAACG TCAATTAATT CCCATCCAAG ACATTAACTC CCTAGCCAGT
ATCCCAGAAC AATTTTTATA TCCTGGACCC CTAGCCACTC CTCAAACAGA AATAGTTCGA
AAAGCTGCTC ATTTGTGTCG CAGTTTACTT CATGCCCGTT TAGAACTACC CATATATCAA
ATTATTCCGT TTTTAGCCTT AACCTTAAAT TACGACCAAG CGGAGTTAGC CACCGCTGAC
AAACTTGCAG AACGGGTAAA CCTGCAAAAT TGGGGTAATA ATTCTATGGG TTCAATGCTG
TCGGCTTTAA GTGAAATCGT CAATTCTGAA CGCTTTGAAC CAGTAGATAC AGAAAATTCA
GAAGAACAAT ATACCAAAAA GGGACAATTA ACAATTACCA CTATGCACAA AGCGAAAGGG
TTGGACTGGG ATTATGTTTT CCTACCGTTT TTGCATGAAA ACTTAATTCC TGGTAAATTT
TGGGTTCCTC CCCAAAGCCA ATTTCTAGGC GATTTTACCT TATCAGAAGT AGCACGCGCC
CAAATTCGCG CCGGACTTCA TCAACAAACA GAAACTATAC CCAATGTAAG CCAAGCTTGG
GAACAAGCCA AATACCTTAA AATGGCCGAA GAATACCGTT TATTGTATGT TGCTATGACC
AGGGCAAAAA GACTTTTATG GATGTCTGCG GCGCATCAAG CACCATTTAC TTGGAGTAAA
CCTGATAGTT TACAAGCTTC AGCCCCTTGT CCGGTATTTG CCGCTCTAGA GCGGCAATTC
TCCAGCTATA CGAATTGA
 
Protein sequence
MSKDNFMVIV NQESLEIEDI QPSPAAKLRK NIDKIRHSLR PGQKQMANWE SGPLAVSAVP 
GAGKSTGMAA AAAIAIARQY QYSLTTGNNY RRQLVVVTFT RSAAANLKLK IRDKLKKLSL
PQTGFAVYTL HGLALNIANR YPDLSGLQLE NVTLITPNQT HRFIRAAVEQ WITKNPELYR
RLLEGQKFDG EETEKLRRQS VLRTEVLPDL AYTVIHEAKS SGILPEKLYK WSERTQDTYQ
ILQVAAGLYE QYQKLMASQD FIDYDDMMLA ALRVLENSSA RRIEQNQIFA IFEDEAQDSS
PLQTQLLEIL AGEEDGNNAD QSTLNLVRVG DPNQAINSTF TPADPIYFRE FCKNCDINQR
LATMNQAGRS SKIIIDAANF ALEWVNSQWS TITKNGQTPF LSQKIAAVNM GEPQLNANPA
PFGKGLELYS PRDIHHTVEL LSQKVIELFT QNSDSRAAVL VRENRQGRWL TSALEAICQE
HQIKLYDVGE MERRSHVPQE ILSLLQFCDR PHSSDYLKAT LQVLVQRQLI PIQDINSLAS
IPEQFLYPGP LATPQTEIVR KAAHLCRSLL HARLELPIYQ IIPFLALTLN YDQAELATAD
KLAERVNLQN WGNNSMGSML SALSEIVNSE RFEPVDTENS EEQYTKKGQL TITTMHKAKG
LDWDYVFLPF LHENLIPGKF WVPPQSQFLG DFTLSEVARA QIRAGLHQQT ETIPNVSQAW
EQAKYLKMAE EYRLLYVAMT RAKRLLWMSA AHQAPFTWSK PDSLQASAPC PVFAALERQF
SSYTN