Gene Aazo_2644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2644 
Symbol 
ID9340443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2731634 
End bp2733223 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content40% 
IMG OID 
ProductXRE family transcriptional regulator 
Protein accessionYP_003721651 
Protein GI298491474 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.126523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTATA CAATTCCTAA CAAAAATTGC GATGGATGTG ATAACTGCCG CCCCCAATGT 
CCTACGGGTG CAATCAAAAT TGAAAATGAT AAATATTGGA TTGATCCTTG TCTGTGTAAC
AATTGTGAGG GTTATTATCC AGAACCGCAA TGTGTAATTG CCTGTCCAAG CAATTCTCCC
ATACCTTGGC AAGCTAAAAA AGGCAGATGC AAAGTTGAAG CGCGAGAACC TACCAGTCCT
GATTTATTTT CTAATGGCAA GAATAACCCT TTTGCTTCGG CAATAGTTAT TTGGGAAGCT
TGCAATCTAC TAGCGCAACG TAAATCATTA AATTGGGAGA CAGACGCAGA AGGTAATTTA
CATTATAGCC GACAAGTTAA TCAAGGTCGG GGTGCAATTT CCTTTCACAT CCAAGACCCA
TTTCAAGTTA GCAACCTGGC TAAAGATTTA CAAGCAATCG AAAGTCTCGA TATTCGGGCT
GCTTGTATTC ATCTGATTTT TGCCGCCCAT GCTACAACCT TAGAACAACC TTGGGATCAG
GAATTTGCGA TTGATGAACG CCAAATTGAA AAATATTTGG GGTTAGAAAA ACGCAAAGAC
CTCAATAAAG TTGCCAAGCT ATGTCTAATC AAAAACATCG TCCAGCAAGG TTGTTCACTG
ATTGTTTCTA TTGACTGGCC TCAAAGGGGT AGAGTTCCCG GATTTTCTGT TTTAGATAGT
CGTTTGTGGC ATTTAACAGA TATACAGCAC CATTTTCAAG AAGATGATCA GGGGTGCAAA
TATCTGATTG GCCTGACATT TAAAATTAAA GCTGGCATCT GGGCGCAACA TTTCTTAAAT
AAACAAGGCT GTAAAGAACG CAGTCGATTC TATCAATATG GTAGTCTTCC AAAAACGCTG
TTAACTACAG TTATGAGTCT TTGGCAGCAA CATGAAGGTG CTGTCAGACT GATGTTATGG
TTATTATTTA AAACCAAAAT GGGTAGAGAA CAACGCATCA CAGTTCCTAC CTTATTACGT
GTTGCTTATG GTGAGGAAAA AATTCACCTA GCCGCTAGAC AAAGAGATGA ACGTAAGCGC
CTCTTGCGAA CATTTGAAAA TGATTTGGAA GTTCTCAATC ATTATGGAGT CAAAGCAATT
TTTGACCCAA TTACTTACCC GCGAGAAATT CAACCTTTGT GGGCTAAATT AGTCGATATT
CCCGAAGATC CACATGAAGC CTTGGAATTT TGGATTAATG ATGGTAGTGG TCATACTCGA
CTTACAGATA GCGGACCTCG TGGTAAATGG AAGTTGCTGA TGAATGCGCG GATTTCATCT
TTTGAACTTC CTCCAGAATG GGAACAACAA AGTTCAGAAG CAGAGAAAAA AAAGCGGCGT
GCCATTAGGA GTAAAAAAAT CATCAAAAGC ACAGATGATT TATTAGCAGA ACAGGTTATA
CAAGCACGAA AAAGTATCGA TCTTTCCCAA AGAGAACTAG CAAAACTCAC TGGTAAAAGC
CAAAGCTGGA TTCGTGATGT GGAAAGTGGC CGTCTTAAAC CCAAATTAGA AGACCAAATA
TTATTGAGAA AGGTCTTGAA TATACTTTAA
 
Protein sequence
MPYTIPNKNC DGCDNCRPQC PTGAIKIEND KYWIDPCLCN NCEGYYPEPQ CVIACPSNSP 
IPWQAKKGRC KVEAREPTSP DLFSNGKNNP FASAIVIWEA CNLLAQRKSL NWETDAEGNL
HYSRQVNQGR GAISFHIQDP FQVSNLAKDL QAIESLDIRA ACIHLIFAAH ATTLEQPWDQ
EFAIDERQIE KYLGLEKRKD LNKVAKLCLI KNIVQQGCSL IVSIDWPQRG RVPGFSVLDS
RLWHLTDIQH HFQEDDQGCK YLIGLTFKIK AGIWAQHFLN KQGCKERSRF YQYGSLPKTL
LTTVMSLWQQ HEGAVRLMLW LLFKTKMGRE QRITVPTLLR VAYGEEKIHL AARQRDERKR
LLRTFENDLE VLNHYGVKAI FDPITYPREI QPLWAKLVDI PEDPHEALEF WINDGSGHTR
LTDSGPRGKW KLLMNARISS FELPPEWEQQ SSEAEKKKRR AIRSKKIIKS TDDLLAEQVI
QARKSIDLSQ RELAKLTGKS QSWIRDVESG RLKPKLEDQI LLRKVLNIL