Gene Aazo_0571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0571 
Symbol 
ID9338358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp598054 
End bp601338 
Gene Length3285 bp 
Protein Length1094 aa 
Translation table11 
GC content43% 
IMG OID 
Productcarbamoyl-phosphate synthase large subunit 
Protein accessionYP_003720190 
Protein GI298490013 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCGTC GTCAAGATAT CAGAAAAATA TTGCTGTTAG GTTCTGGTCC GATTGTGATT 
GGCCAAGCTT GTGAATTTGA CTATTCTGGT ACTCAAGCTT GTAAAGCTTT GCGGGAAGAG
GGTTATGAGG TCGTGTTGGT TAATTCTAAC CCTGCTACCA TTATGACTGA CCCGGAAACT
GCCGATCGCA CTTATATTGA ACCACTAACA CCGGAATTGG TAGCAAAGGT CATTGAAAAA
GAACGTCCAG ATGCTTTGTT ACCAACAATG GGAGGACAAA CCGCCCTTAA TTTGGCTGTG
GCTTTGTCGA AAAATGGGGT GTTGGATAAG TATAATGTGG AATTGATTGG GGCAAAATTA
CCAGCAATTG AAAAAGCTGA AAATCGGAAG TTGTTTAATG AAGCGATGGG CAAGATTGGG
GTTCCAGTTT GTCCTAGTGG TACAGCGTCT TCTTTGGAAG AATCTAAAGA GATCGCTCAT
CATATCGGTA CTTATCCTCT CATTATTCGT CCCGCTTTTA CAATGGGTGG AACCGGTGGC
GGTATCGCCT ATAATCAAGA AGAGTTTGAG CTGATGGCAC AGGTCGGTAT TGATGCTAGT
CCTGTTTCTC AGATTCTCAT TGACCAATCT TTGCTAGGTT GGAAAGAGTA TGAACTAGAA
GTAATGCGAG ATTTAGCAGA TAACGTGGTG ATTATCTGTT CAATCGAAAA TTTCGATCCT
ATGGGCATTC ATACCGGCGA TTCTATCACA GTTGCACCTG CTCAAACTCT CACAGATAAG
GAATATCAAC GTCTACGAGA TATGGCAATT AAAATTATCC GCGAAATTGG GGTAGAGACC
GGCGGTTCTA ATATTCAGTT TGCGGTAAAT CCTGTGAACG GGGATGTGGT AGTTATTGAA
ATGAACCCCC GTGTATCTCG TAGTTCTGCT TTAGCTTCCA AAGCCACTGG TTTTCCCATA
GCGGGAATAG CCGCAAAGTT AGCTGTCGGT TATACCTTGG ATGAAATTAA AAATGACATC
ACGAAACAAA CTCCTGCATC CTTTGAACCG ACTATAGATT ATGTGGTGAT AAAGATTCCC
CGGTTTGCCT TTGAAAAATT CCCTGGTTCT GACTCGGTTC TGACTACACA AATGAAATCT
GTCGGGGAAG CAATGGCTAT TGGCCGGACA TTTAATGAAT CTTTCCAAAA AGCCCTGCGT
TCTTTAGAAA CGGGTCGTGC AGGTTGGGGT TGTGATAAGT CAGAAAAATT ACCTAGTGCG
GAACAAATAC GCGCTCAATT ACGGACTCCC AACCCAGAAA GAGTATATGC GTTGCGTCAT
GCGATGCAGT TGGGTATCAC TAATGAAGAG ATTTATGAAC TAACAGCCAT TGATCCTTGG
TTTTTGGATA AATTACAGCA AATCTTGGAA GTTGAGAAGT TCCTCAAACG CACACCTTTA
CAGCAGTTGA CAAAAGAGAA AATGTATGAA GTGAAGCGAA ATGGATTTAG CGATCGCCAA
ATTGCCTATG CGACCAAAAC CAAGGAAGAT GAAGTGAGAG CATATCGGCA AAAACTAGGT
ATTAAACCAG TTTACAAAAC TGTGGATACC TGCGCGGCTG AATTTGAAGC TTTCACACCT
TATTACTATT CTACCTACGA AGAAGAAACG GAAGTATTAC CCACCGACAA GCCCAAGGTG
ATGATTTTGG GAGGTGGTCC AAACCGTATT GGACAGGGAA TTGAATTTGA TTACTGTTGT
TGTCATGCAG CTTATTCTCT GAAAGCTGCC GGTTATGAAA CCATCATGGT GAACTCTAAC
CCAGAGACAG TTTCTACAGA TTACGATACC AGCGATCGCT TGTACTTTGA ACCTTTAACC
AAAGAAGACG TTATCAACAT CATTGAAGCC GAGAACCCTG TCGGTATTAT TGTCCAGTTC
GGTGGACAAA CACCATTAAA ATTAGCCATA CCATTACAGC AATATTTACA GGGAAGAAGT
TGCCAGTCCC CAGTTTCCAG TTCCCAGTCC CCAGTCCCTC AGATTTGGGG TACATCTCCT
GATTCTATCG ACATGGCAGA GAATCGGGAA CGGTTTGAAA ACATTTTGCA AGAGTTAAAT
ATTGCTCAAC CGCCTAATGG TATTGCTAGA AGTTATGAAG ATGCATTAAT AGTTGCCAAA
CGGATTGGGT ATCCTGTCGT AGTTCGTCCT AGCTATGTAT TAGGAGGAAG GGGGATGGAA
ATCGTCTATT CTGATGCAGA GTTAGAAAGA TACATGACTT TTGCAGTACA GGTAGAACCA
GAACACCCGA TTTTAATTGA TAAATTTTTA GAAAATGCCA TTGAAGTGGA TGTAGATGCG
ATCGCCGATT ATACAGGTAA AGTCGTCATA GGCGGCATTA TGGAACACAT AGAACAGGCC
GGAATTCACT CAGGAGACTC CGCTTGTTCC CTACCATCAA TCTCTCTTTC CCCAGCCGTA
TTAAACCAAA TCCGCACCTG GACTGTGCAA CTAGCACAAG CCTTGTCCGT TGTGGGTTTA
ATGAACATTC AATTTGCAGT CGTTGGTGCA AACGGTTACT CTCCCCAAGT TTACATCCTA
GAAGCCAACC CTAGAGCATC CCGTACCGTC CCCTTTGTTT CCAAAGCCAC AGGTATCCCC
TTAGCCAAAT TAGCATCCTT AATCATGTCG GGTAAAACCC TAGAAGAATT GAACTTTACC
CAAGAAGTTA TTCCTTCTCA TATAGCCGTT AAAGAAGCTG TATTACCCTT TAATAAATTC
CCCGGTACAG ATACTTTATT AGGACCGGAA ATGCGTTCCA CAGGGGAGGT CATGGGTATT
GACGCTGACT TTGGCCGCGC TTTTGCAAAA GCAGAATTAG GTGCAGGGGA AAAACTCCCA
CGTAAAGGAA GCGTATTTGT GTCTATGAGT GATAGAGATA AAGGTGCAGC CATAGAGGTA
GTAAAAGAAT TTATCAGCTT AGGTTTTACC ATCATCGCTA CCCAAGGGAC ACGCCAAGTT
CTACAGCAAA ACGGGGTAAA AGTTGACTTA ATCTTGAAAC TACATGAAGG GCGTCCCCAC
GTCCTTGATG CTATCAAAAA TGAGAAAATC CAACTAATTA TTAATACGCC ATCAGGAGAG
GAAGCACAAA CCGATGCGCG GTTAATCCGA CGTACTGGCC TAGCCTACAA AATCCCTATC
ATTACTACCA TAGCTGGAGC TAGAGCAACA GTAGCAGCTA TCCGTTCTTT GCAAAATACG
ACTTTGGATG TGAAGGTGAT TCAAGAATAT TGCCCAATGG GGTAG
 
Protein sequence
MPRRQDIRKI LLLGSGPIVI GQACEFDYSG TQACKALREE GYEVVLVNSN PATIMTDPET 
ADRTYIEPLT PELVAKVIEK ERPDALLPTM GGQTALNLAV ALSKNGVLDK YNVELIGAKL
PAIEKAENRK LFNEAMGKIG VPVCPSGTAS SLEESKEIAH HIGTYPLIIR PAFTMGGTGG
GIAYNQEEFE LMAQVGIDAS PVSQILIDQS LLGWKEYELE VMRDLADNVV IICSIENFDP
MGIHTGDSIT VAPAQTLTDK EYQRLRDMAI KIIREIGVET GGSNIQFAVN PVNGDVVVIE
MNPRVSRSSA LASKATGFPI AGIAAKLAVG YTLDEIKNDI TKQTPASFEP TIDYVVIKIP
RFAFEKFPGS DSVLTTQMKS VGEAMAIGRT FNESFQKALR SLETGRAGWG CDKSEKLPSA
EQIRAQLRTP NPERVYALRH AMQLGITNEE IYELTAIDPW FLDKLQQILE VEKFLKRTPL
QQLTKEKMYE VKRNGFSDRQ IAYATKTKED EVRAYRQKLG IKPVYKTVDT CAAEFEAFTP
YYYSTYEEET EVLPTDKPKV MILGGGPNRI GQGIEFDYCC CHAAYSLKAA GYETIMVNSN
PETVSTDYDT SDRLYFEPLT KEDVINIIEA ENPVGIIVQF GGQTPLKLAI PLQQYLQGRS
CQSPVSSSQS PVPQIWGTSP DSIDMAENRE RFENILQELN IAQPPNGIAR SYEDALIVAK
RIGYPVVVRP SYVLGGRGME IVYSDAELER YMTFAVQVEP EHPILIDKFL ENAIEVDVDA
IADYTGKVVI GGIMEHIEQA GIHSGDSACS LPSISLSPAV LNQIRTWTVQ LAQALSVVGL
MNIQFAVVGA NGYSPQVYIL EANPRASRTV PFVSKATGIP LAKLASLIMS GKTLEELNFT
QEVIPSHIAV KEAVLPFNKF PGTDTLLGPE MRSTGEVMGI DADFGRAFAK AELGAGEKLP
RKGSVFVSMS DRDKGAAIEV VKEFISLGFT IIATQGTRQV LQQNGVKVDL ILKLHEGRPH
VLDAIKNEKI QLIINTPSGE EAQTDARLIR RTGLAYKIPI ITTIAGARAT VAAIRSLQNT
TLDVKVIQEY CPMG