Gene Aazo_4767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4767 
Symbol 
ID9342574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4866955 
End bp4868271 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content43% 
IMG OID 
Productgid protein 
Protein accessionYP_003723068 
Protein GI298492891 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACAAC AACCGATACA AGTAATTGGC GGTGGACTAG CGGGAACAGA AGCGGCGTGG 
CAAATTGCCC AAGCTGGCAT TCCTGTAATT CTCCACGAGA TGCGTCCTAA ACGCTTCAGC
CCTGCCCATC ATACCGAAAA TTTGGCAGAA TTGGTATGTA GTAACTCTTT CGGGTCCATG
GCGAGCGATC GCGCGGCAGG ATTATTACAC GAAGAATTAC GTCAACTCGG TTCTATTGTT
ATTGCTAAAG CTGACGAACA CGCAGTCCCC GCTGGTGGTG CATTAGCAGT AGACAGAGCA
CAATTTGGGG AAGATTTAAC CCAAACGTTA GCAAATCATC CTTTAATTGA TTTCCGACGG
GGAGAAGTGA CAGCAATTCC CGAAGGTATT GTAGCTTTGG CAAGTGGTCC TTTAACCAGT
CCCGATTTAT CCGCAGATTT ACAACGATTT ACCGGGATGG AATATCTCAA TTTCTTTGAT
GCTGCCAGTC CTATTATTGT TGGAGATTCT ATTAATAAAG ATGTTGCATT TATGGCTTCC
CGTTATGACA AAGGTGAAGG AGCTTATCTT AATTGTCCCA TGAATAAAGA GCAGTATTTA
CATTTTTGGG AGGAATTACG TAAAGCCGAA CAAACAGAAT TAAAAGACTT TGAAAAGGAA
ACAGCAAAAT TTTTTGAAGC TTGTTTACCG ATTGAAGAAA TGGCACGACG GGGGGAAGAC
ACCATGCGTT ATGGACCTCT CAAACCGGTG GGTTTATCGG ATAGTCGCAC AGGAGAAAGT
CCTTATGCGG TAATTCAATT AAGACAAGAA GATAAAGCCC ATCAACTTTG GAATATGGTA
GGATTCCAAA CTAATCTGCG GTGGGGTGAA CAAAAGCGCG TATTCCAAAT GATTCCTGGT
TTGGAAAAAG CCGAATTTGT CAGATTAGGA GTCATGCACC GCAATACCTT TTTAAATGCA
CCACAGTTAA TGTCTGCAAG TTTGCAATTT AAAGAACGTC CAACTTTATT AGCTGCGGGA
CAATTAATAG GAACAGAAGG TTATACTGCT GCATCTGCGG GTGGTTGGTT AGCGGGAACA
AATGCAGCGC GGTTAGCTTT GGGTAAAGAA CCTCTAATTC TGCCTGTAAC AACGATGATG
GGGGCTTTGT TTGAGTTTAT CAGATCCGCT TCACCTAAGC ATTTTCAACC GATGGCTCCT
AATTTCGGCA TTTTGCCAGA TTTGGGAGTG AAAATCAAGA GTAAACCGGA AAAATATGGA
CGTTATCGCG ATCGCGCTTT GGCAGATTTA GCAAATTGGA AAGTTAACCA CTTATAA
 
Protein sequence
MTQQPIQVIG GGLAGTEAAW QIAQAGIPVI LHEMRPKRFS PAHHTENLAE LVCSNSFGSM 
ASDRAAGLLH EELRQLGSIV IAKADEHAVP AGGALAVDRA QFGEDLTQTL ANHPLIDFRR
GEVTAIPEGI VALASGPLTS PDLSADLQRF TGMEYLNFFD AASPIIVGDS INKDVAFMAS
RYDKGEGAYL NCPMNKEQYL HFWEELRKAE QTELKDFEKE TAKFFEACLP IEEMARRGED
TMRYGPLKPV GLSDSRTGES PYAVIQLRQE DKAHQLWNMV GFQTNLRWGE QKRVFQMIPG
LEKAEFVRLG VMHRNTFLNA PQLMSASLQF KERPTLLAAG QLIGTEGYTA ASAGGWLAGT
NAARLALGKE PLILPVTTMM GALFEFIRSA SPKHFQPMAP NFGILPDLGV KIKSKPEKYG
RYRDRALADL ANWKVNHL