Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_0571 |
Symbol | |
ID | 9338358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 598054 |
End bp | 601338 |
Gene Length | 3285 bp |
Protein Length | 1094 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | |
Product | carbamoyl-phosphate synthase large subunit |
Protein accession | YP_003720190 |
Protein GI | 298490013 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCGTC GTCAAGATAT CAGAAAAATA TTGCTGTTAG GTTCTGGTCC GATTGTGATT GGCCAAGCTT GTGAATTTGA CTATTCTGGT ACTCAAGCTT GTAAAGCTTT GCGGGAAGAG GGTTATGAGG TCGTGTTGGT TAATTCTAAC CCTGCTACCA TTATGACTGA CCCGGAAACT GCCGATCGCA CTTATATTGA ACCACTAACA CCGGAATTGG TAGCAAAGGT CATTGAAAAA GAACGTCCAG ATGCTTTGTT ACCAACAATG GGAGGACAAA CCGCCCTTAA TTTGGCTGTG GCTTTGTCGA AAAATGGGGT GTTGGATAAG TATAATGTGG AATTGATTGG GGCAAAATTA CCAGCAATTG AAAAAGCTGA AAATCGGAAG TTGTTTAATG AAGCGATGGG CAAGATTGGG GTTCCAGTTT GTCCTAGTGG TACAGCGTCT TCTTTGGAAG AATCTAAAGA GATCGCTCAT CATATCGGTA CTTATCCTCT CATTATTCGT CCCGCTTTTA CAATGGGTGG AACCGGTGGC GGTATCGCCT ATAATCAAGA AGAGTTTGAG CTGATGGCAC AGGTCGGTAT TGATGCTAGT CCTGTTTCTC AGATTCTCAT TGACCAATCT TTGCTAGGTT GGAAAGAGTA TGAACTAGAA GTAATGCGAG ATTTAGCAGA TAACGTGGTG ATTATCTGTT CAATCGAAAA TTTCGATCCT ATGGGCATTC ATACCGGCGA TTCTATCACA GTTGCACCTG CTCAAACTCT CACAGATAAG GAATATCAAC GTCTACGAGA TATGGCAATT AAAATTATCC GCGAAATTGG GGTAGAGACC GGCGGTTCTA ATATTCAGTT TGCGGTAAAT CCTGTGAACG GGGATGTGGT AGTTATTGAA ATGAACCCCC GTGTATCTCG TAGTTCTGCT TTAGCTTCCA AAGCCACTGG TTTTCCCATA GCGGGAATAG CCGCAAAGTT AGCTGTCGGT TATACCTTGG ATGAAATTAA AAATGACATC ACGAAACAAA CTCCTGCATC CTTTGAACCG ACTATAGATT ATGTGGTGAT AAAGATTCCC CGGTTTGCCT TTGAAAAATT CCCTGGTTCT GACTCGGTTC TGACTACACA AATGAAATCT GTCGGGGAAG CAATGGCTAT TGGCCGGACA TTTAATGAAT CTTTCCAAAA AGCCCTGCGT TCTTTAGAAA CGGGTCGTGC AGGTTGGGGT TGTGATAAGT CAGAAAAATT ACCTAGTGCG GAACAAATAC GCGCTCAATT ACGGACTCCC AACCCAGAAA GAGTATATGC GTTGCGTCAT GCGATGCAGT TGGGTATCAC TAATGAAGAG ATTTATGAAC TAACAGCCAT TGATCCTTGG TTTTTGGATA AATTACAGCA AATCTTGGAA GTTGAGAAGT TCCTCAAACG CACACCTTTA CAGCAGTTGA CAAAAGAGAA AATGTATGAA GTGAAGCGAA ATGGATTTAG CGATCGCCAA ATTGCCTATG CGACCAAAAC CAAGGAAGAT GAAGTGAGAG CATATCGGCA AAAACTAGGT ATTAAACCAG TTTACAAAAC TGTGGATACC TGCGCGGCTG AATTTGAAGC TTTCACACCT TATTACTATT CTACCTACGA AGAAGAAACG GAAGTATTAC CCACCGACAA GCCCAAGGTG ATGATTTTGG GAGGTGGTCC AAACCGTATT GGACAGGGAA TTGAATTTGA TTACTGTTGT TGTCATGCAG CTTATTCTCT GAAAGCTGCC GGTTATGAAA CCATCATGGT GAACTCTAAC CCAGAGACAG TTTCTACAGA TTACGATACC AGCGATCGCT TGTACTTTGA ACCTTTAACC AAAGAAGACG TTATCAACAT CATTGAAGCC GAGAACCCTG TCGGTATTAT TGTCCAGTTC GGTGGACAAA CACCATTAAA ATTAGCCATA CCATTACAGC AATATTTACA GGGAAGAAGT TGCCAGTCCC CAGTTTCCAG TTCCCAGTCC CCAGTCCCTC AGATTTGGGG TACATCTCCT GATTCTATCG ACATGGCAGA GAATCGGGAA CGGTTTGAAA ACATTTTGCA AGAGTTAAAT ATTGCTCAAC CGCCTAATGG TATTGCTAGA AGTTATGAAG ATGCATTAAT AGTTGCCAAA CGGATTGGGT ATCCTGTCGT AGTTCGTCCT AGCTATGTAT TAGGAGGAAG GGGGATGGAA ATCGTCTATT CTGATGCAGA GTTAGAAAGA TACATGACTT TTGCAGTACA GGTAGAACCA GAACACCCGA TTTTAATTGA TAAATTTTTA GAAAATGCCA TTGAAGTGGA TGTAGATGCG ATCGCCGATT ATACAGGTAA AGTCGTCATA GGCGGCATTA TGGAACACAT AGAACAGGCC GGAATTCACT CAGGAGACTC CGCTTGTTCC CTACCATCAA TCTCTCTTTC CCCAGCCGTA TTAAACCAAA TCCGCACCTG GACTGTGCAA CTAGCACAAG CCTTGTCCGT TGTGGGTTTA ATGAACATTC AATTTGCAGT CGTTGGTGCA AACGGTTACT CTCCCCAAGT TTACATCCTA GAAGCCAACC CTAGAGCATC CCGTACCGTC CCCTTTGTTT CCAAAGCCAC AGGTATCCCC TTAGCCAAAT TAGCATCCTT AATCATGTCG GGTAAAACCC TAGAAGAATT GAACTTTACC CAAGAAGTTA TTCCTTCTCA TATAGCCGTT AAAGAAGCTG TATTACCCTT TAATAAATTC CCCGGTACAG ATACTTTATT AGGACCGGAA ATGCGTTCCA CAGGGGAGGT CATGGGTATT GACGCTGACT TTGGCCGCGC TTTTGCAAAA GCAGAATTAG GTGCAGGGGA AAAACTCCCA CGTAAAGGAA GCGTATTTGT GTCTATGAGT GATAGAGATA AAGGTGCAGC CATAGAGGTA GTAAAAGAAT TTATCAGCTT AGGTTTTACC ATCATCGCTA CCCAAGGGAC ACGCCAAGTT CTACAGCAAA ACGGGGTAAA AGTTGACTTA ATCTTGAAAC TACATGAAGG GCGTCCCCAC GTCCTTGATG CTATCAAAAA TGAGAAAATC CAACTAATTA TTAATACGCC ATCAGGAGAG GAAGCACAAA CCGATGCGCG GTTAATCCGA CGTACTGGCC TAGCCTACAA AATCCCTATC ATTACTACCA TAGCTGGAGC TAGAGCAACA GTAGCAGCTA TCCGTTCTTT GCAAAATACG ACTTTGGATG TGAAGGTGAT TCAAGAATAT TGCCCAATGG GGTAG
|
Protein sequence | MPRRQDIRKI LLLGSGPIVI GQACEFDYSG TQACKALREE GYEVVLVNSN PATIMTDPET ADRTYIEPLT PELVAKVIEK ERPDALLPTM GGQTALNLAV ALSKNGVLDK YNVELIGAKL PAIEKAENRK LFNEAMGKIG VPVCPSGTAS SLEESKEIAH HIGTYPLIIR PAFTMGGTGG GIAYNQEEFE LMAQVGIDAS PVSQILIDQS LLGWKEYELE VMRDLADNVV IICSIENFDP MGIHTGDSIT VAPAQTLTDK EYQRLRDMAI KIIREIGVET GGSNIQFAVN PVNGDVVVIE MNPRVSRSSA LASKATGFPI AGIAAKLAVG YTLDEIKNDI TKQTPASFEP TIDYVVIKIP RFAFEKFPGS DSVLTTQMKS VGEAMAIGRT FNESFQKALR SLETGRAGWG CDKSEKLPSA EQIRAQLRTP NPERVYALRH AMQLGITNEE IYELTAIDPW FLDKLQQILE VEKFLKRTPL QQLTKEKMYE VKRNGFSDRQ IAYATKTKED EVRAYRQKLG IKPVYKTVDT CAAEFEAFTP YYYSTYEEET EVLPTDKPKV MILGGGPNRI GQGIEFDYCC CHAAYSLKAA GYETIMVNSN PETVSTDYDT SDRLYFEPLT KEDVINIIEA ENPVGIIVQF GGQTPLKLAI PLQQYLQGRS CQSPVSSSQS PVPQIWGTSP DSIDMAENRE RFENILQELN IAQPPNGIAR SYEDALIVAK RIGYPVVVRP SYVLGGRGME IVYSDAELER YMTFAVQVEP EHPILIDKFL ENAIEVDVDA IADYTGKVVI GGIMEHIEQA GIHSGDSACS LPSISLSPAV LNQIRTWTVQ LAQALSVVGL MNIQFAVVGA NGYSPQVYIL EANPRASRTV PFVSKATGIP LAKLASLIMS GKTLEELNFT QEVIPSHIAV KEAVLPFNKF PGTDTLLGPE MRSTGEVMGI DADFGRAFAK AELGAGEKLP RKGSVFVSMS DRDKGAAIEV VKEFISLGFT IIATQGTRQV LQQNGVKVDL ILKLHEGRPH VLDAIKNEKI QLIINTPSGE EAQTDARLIR RTGLAYKIPI ITTIAGARAT VAAIRSLQNT TLDVKVIQEY CPMG
|
| |