Gene Aazo_4404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4404 
Symbol 
ID9342206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4484284 
End bp4487784 
Gene Length3501 bp 
Protein Length1166 aa 
Translation table11 
GC content41% 
IMG OID 
Producttranscription-repair coupling factor 
Protein accessionYP_003722842 
Protein GI298492665 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.353859 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTTTT CTTCTATTGT GCGTGCCTTG GCGCGATCAC CTCTCACCGC CGAACTAATT 
ACCAAACTCA AAAAGTACCA AGAATTGCGG TTAAATGGCA TTTCCCGTTT ACCCAAGGGT
CTGGTAGCTT CAGCTTTAGC CAACAATGAG GGTAGGGATT TGTGCGTGGT CTGTGCCACT
CTCGAAGAAG CCGGACGGGT TTATGCCCAA ATGGAAGCGA TGGGATGGAA AACTGTGCAT
TTTTACCCTG CCGCAGAAGC TTCTCCCTAT GAACCTTTTG ACCCGGAAAC TGAGTTAAGT
TGGGGACAAA TGCAGGTTTT AGCCGATTTG GTCAATGGTC AATGGTCAGT CGTCAGTAGT
CAGTTACCTA ATAGGAATAC GGCCATTATT GCTACTGTAG GCGCATTACA ACCGCATTTA
CCACCTCTAG AGGTATTTAG ATCTTTTTGT CTGTCCTTAA AAAAGGGTCT GGAATACGAT
TTAGATGAGT TCAGTGAAAA AATCACCAGT TTGGGCTATG AACGGGTGCC GTTGGTGGAA
ACGGAAGGAC AATGGAGTAG ACGGGGAGAT ATTGTGGATG TGTTTCCTGT TTCCTCGGAG
TTACCAGTGC GGTTGGAATG GTTTGGGGAT GAAATCAAAC AAATTCGGGA ATTTGATCCC
GCCACTCAAC GTTCTGCTTT AGATAAAGTT GAACAAATAT TTCTTACCCC AACAAGTTTT
TCAGGGATTG TTTTGGCAGC ACTCAAACAG AGTTCTGAGT TTCGTGTGTT GAGTGCTGAC
TTAAATTCTG ATGTAGATGA TTTGGAAAAT TTAGGACTGG AGGGAAGCAG GCGTTTTCTA
GGGTTAGCTT TTGCCAAACC CGCTTCACTT TTAGATTATT TATCAGCAAA TACCCTTATT
GCCATTGATG AGGTAGAACA GTGTCATGCC CATAGCGCTC GCTGGGTGGA AAATGCGGAT
AGTCAGTGGA GACTTGGGAC TGGGGAAGAA TCTCAGCAAG TTCCCAAAAT TCATCGGACT
TTTAATGAGT GTTTAGATGA AGCTGGAAAT TTTCAAAAGT TATATTTATC AGAATTAGCG
GAAGAAAACA GCGGTACTAA TTTAGCTAGT AGACCTTTAC CTGTAACACC TCACCAGTTT
GCCAAGTTAG CTGAGAAAAT TAGACAGGAA CGCGATCGCA AGTTTGCAGT TTGGCTAATT
TCTGCTCAAC CTTCCCGTTC TGTTTCTCTA CTCCAAGAAC ATGATTGTCC TGCCCAGTTT
ATCCCTAATC CCCGTGATTA TCAAGCGATT GGTAAACTAC AAATAAATCA TACGCCTATT
GCACTCAAAT ATTCTGGTTT AGCAGAATTA GAAGGTTTCA TTTTACCTTC TTTCCGTCTT
GTTATTGTTA CCGATCGCGA ATTTTATGGA CAACATTCCT TAGCGGATTT TGGTTATGTC
CGCAAACGTC GCAAAGCTAT ATCTAAACAA GTTGATACCA ATAAATTACG ACCAGGAGAT
TTTGTTGTTC ATCGTAGTCA CGGCATTGGT AAATTTGTCG AACTAGAAAG TTTAACAATT
AATAATGAAA CCCGTGATTA TTTAGTTATT CAATATGCCG ATGGTTTGTT GAAAGTTGCT
GCTGATAAAG TTGGTTCTCT ATCTCGGTTT AGAACCAGTG GAGATCAAAC ACCCGCACTA
CATAAAATGA CGGGTAAAGC TTGGGATAAT ACCAAGAATA AAGTCCGTAA GGCGATTAAG
AAATTAGCAG TAGATTTGTT AAAGTTGTAT GCTGCGCGAT CGCAACAACA AGGGTTTGCT
TATCCCGCAG ATATGCCTTG GCAGGAAGAA ATGGAAGATT CTTTTCCATA CCAAGCTACC
ACCGATCAGC TAAAAGCGGT GCAAGATGTG AAACGGGATA TGGAAAGTGA AAGACCGATG
GATCGGTTAG TGTGTGGTGA TGTGGGTTTT GGTAAAACTG AAGTTGCAAT TCGGGCTATT
TTCAAAGCTG TTACCGCAGG TAAACAAGTC GCAGTTTTAG CACCAACAAC TATTTTAACC
CAACAACATT ATCACACAAT CAAAGAACGT TTTTCACCTT ATCCTGTAAA TGTGGGTTTA
CTTAATCGTT TTCGTAGTGC GGAAGAAAAG CGTAATATTC AAAAGCGTCT GGCTACTGGA
GAATTAGATA TTGTTGTAGG GACACATCAA TTATTAGGTA AAGGTGTACA ATTTCGAGAT
TTGGGACTGT TGGTAATTGA TGAGGAACAA AGATTTGGTG TGAATCAAAA GGAAAAAATC
AAAAGCCTGA AAACTCAAGT TGATGTTTTA ACTCTTTCTG CGACTCCCAT TCCCAGAACT
TTGTATATGT CTTTATCTGG CATTCGGGAA ATGAGTTTAA TTACCACACC ACCTCCCACC
AGAAGACCAA TTCAAACCCA TCTTGCACCT TTAAACCCGG AAATTGTCAG AAGTGCAATT
CGGCAAGAAT TAGATAGAGG AGGGCAGGTT TTTTATGTAG TTCCGCGAGT GGAGGGAATT
GAAGAAACAA CTGCAAATTT GCGAGAAATG ATACCGGGGG GAAGATTTGC AATCGCACAC
GGTCAAATGG AAGAAAGCGA GTTAGAATCA ACCATGCTCA CCTTCGGAAA TAATGACGCT
GATATCCTAG TTTGCACAAC AATCATTGAA TCCGGTTTAG ATATTCCGCG AGTTAACACA
ATTTTAATTG AAGATGCTCA CCGTTTTGGT TTAGCTCAAT TATATCAATT ACGTGGTCGT
GTGGGACGTG CAGGAATACA AGCTCACGCA TGGTTATTTT ATCATAAACA GCGTGAATTA
TCCGATGCAG CCAGACTAAG ATTAAGAGCA ATTCAAGAAT TTACCCAACT CGGTTCTGGA
TATCAATTAG CAATGCGAGA TATGGAAATT CGTGGTGTGG GTAACTTGTT AGGTGCAGAA
CAGTCAGGTC AAATGGATGC AATCGGATTT GACTTGTACA TGGAAATGTT AGAGGAAGCA
ATTCGAGAAA TTAGAGGTCA AGAAATACCC AAAGTTGATG ATACCCAAAT TGACCTTAAT
CTCACCGCGT TTATTCCTTC CACTTACATT ACCGATATTG ACCAAAAAAT GAGTGCTTAT
CGTGCAGTAG CAACTGCAAA ATCCAAAGAA GAATTAAAAT CCATTGCCGC GGAATGGACT
GATAGATATG GAACTATACC AGTTCCAGCA AATCAACTCT TGCGAGTTAT GGAATTAAAA
CAATTAGCGA GAAATATAGG ATTTAGCCGC ATTAAACCTG AGAACAAACA GCATATTGTG
TTAGAAACAC CAATGGAAGA ACCAGCTTGG AATTTGTTAG CAGAAAATTT GACTGAAACT
ATGCGAAATC GCTTTGTTTA TTCCAGTGGA AAGGTGACAG CAAGAGGTTT AGGTGTGTTG
AAAGCCGAAC AACAATTGCA AACTTTAATA GATGCTTTCG GAAAAATGCA AGGTGCTATT
CCTGAAGCAG TTTTAGTTTG A
 
Protein sequence
MAFSSIVRAL ARSPLTAELI TKLKKYQELR LNGISRLPKG LVASALANNE GRDLCVVCAT 
LEEAGRVYAQ MEAMGWKTVH FYPAAEASPY EPFDPETELS WGQMQVLADL VNGQWSVVSS
QLPNRNTAII ATVGALQPHL PPLEVFRSFC LSLKKGLEYD LDEFSEKITS LGYERVPLVE
TEGQWSRRGD IVDVFPVSSE LPVRLEWFGD EIKQIREFDP ATQRSALDKV EQIFLTPTSF
SGIVLAALKQ SSEFRVLSAD LNSDVDDLEN LGLEGSRRFL GLAFAKPASL LDYLSANTLI
AIDEVEQCHA HSARWVENAD SQWRLGTGEE SQQVPKIHRT FNECLDEAGN FQKLYLSELA
EENSGTNLAS RPLPVTPHQF AKLAEKIRQE RDRKFAVWLI SAQPSRSVSL LQEHDCPAQF
IPNPRDYQAI GKLQINHTPI ALKYSGLAEL EGFILPSFRL VIVTDREFYG QHSLADFGYV
RKRRKAISKQ VDTNKLRPGD FVVHRSHGIG KFVELESLTI NNETRDYLVI QYADGLLKVA
ADKVGSLSRF RTSGDQTPAL HKMTGKAWDN TKNKVRKAIK KLAVDLLKLY AARSQQQGFA
YPADMPWQEE MEDSFPYQAT TDQLKAVQDV KRDMESERPM DRLVCGDVGF GKTEVAIRAI
FKAVTAGKQV AVLAPTTILT QQHYHTIKER FSPYPVNVGL LNRFRSAEEK RNIQKRLATG
ELDIVVGTHQ LLGKGVQFRD LGLLVIDEEQ RFGVNQKEKI KSLKTQVDVL TLSATPIPRT
LYMSLSGIRE MSLITTPPPT RRPIQTHLAP LNPEIVRSAI RQELDRGGQV FYVVPRVEGI
EETTANLREM IPGGRFAIAH GQMEESELES TMLTFGNNDA DILVCTTIIE SGLDIPRVNT
ILIEDAHRFG LAQLYQLRGR VGRAGIQAHA WLFYHKQREL SDAARLRLRA IQEFTQLGSG
YQLAMRDMEI RGVGNLLGAE QSGQMDAIGF DLYMEMLEEA IREIRGQEIP KVDDTQIDLN
LTAFIPSTYI TDIDQKMSAY RAVATAKSKE ELKSIAAEWT DRYGTIPVPA NQLLRVMELK
QLARNIGFSR IKPENKQHIV LETPMEEPAW NLLAENLTET MRNRFVYSSG KVTARGLGVL
KAEQQLQTLI DAFGKMQGAI PEAVLV