Gene Aazo_5037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5037 
Symbol 
ID9342846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5156456 
End bp5158426 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content45% 
IMG OID 
Productacetate/CoA ligase 
Protein accessionYP_003723268 
Protein GI298493091 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCAAC CGACTATAGA ATCTATCCTA CAGGAAAACC GACTTTTTCA TCCCTCTAGT 
AATTTTTCGC AACAGGCTAA TATCAAAAGT CTGGAAGAAT ATCAGCGGAT TTATGATCAA
GCTAAGGCTG ATCCACAGGC TTTTTGGGCA AAATTAGCGG AAACGGAGTT AGATTGGTTT
CAAAAATGGG ATATTGTGCT AGATTGGCAA CCTCCTTTTG CGAAGTGGTT TGTGGGTGGT
AAGATTAATA TCTCTTACAA TTGTCTTGAC AGACATCTGA CTACTTGGCG GAAAAATAAG
GCTGCTTTGA TTTGGGAAGG TGAACCGGGA GATTCCCGCA CGCTAACATA CTCCCAATTG
CATCGAGAAG TTTGCCAGTT TGCCAATGTA CTGAAACAGT TGGGGTTCAA AAAAGGTGAT
CGCATTGGTA TTTATATGCC GATGATTCCC GAAGCTGCTA TTGCGATGTT AGCCTGTGCG
AGAATTGGCG CACCCCATAG CGTTGTCTTT GGTGGGTTTA GTGCGGAGGC TTTGCGCGAT
CGCCTTAACG ATGCTGAGGC TAAATTAGTA GTAACAGCAG ATGGTGGTTG GCGTAAAGAT
GCGATCGTTC CCCTGAAAGA ACAGGTAGAT AAAGCCTTAG CTGATAACGC AGTTCCCAGC
GTCACAGATG TGCTGGTGGT AAAACGCACA GGTCAAAAAA CCCAGATGGA ACCAGGACGG
GATCACTGGT GGCATGATTT ACAAAAAGGT GTCTCCGCAG ATTGTCCCGC CGAACCAATG
GACAGCGAAG ATATGCTGTT TGTCCTTTAT ACTTCTGGCA GTACTGGTAA ACCCAAGGGT
GTTGTCCATA CAACTGGTGG TTATAACTTA TACAGCCATA TTACCACAAA ATGGATTTTT
GACCTCCAGG ACACAGATGT ATATTGGTCT ACTGCTGATG TAGGTTGGAT TACAGGACAT
AGCTATATTG TTTATGGACC CCTTTCCAAT GGTACAACCA CTATTATGTA TGAAGGTGCG
CCCCGTGGTT CTAATCCTGG TTGCTTCTGG GATATAATTG AAAAATACGG CATAACTATC
TTTTATACCG CACCGACAGC CATCCGCGCC TTTATTAAGA TGGGTGAACA CCATCCGAGA
AAACGCAATC TTTCTTCCTT ACGTTTACTG GGAAGTGTCG GTGAACCTAT TAACCCAGAA
GCTTGGATGT GGTATCACAA AATCATTGGT GGTGAACGCT GCCCTATTGT TGATACTTGG
TGGCAAACGG AAACTGGTGG TATTATGATT ACACCCTTAC CTGGTGCAAT TCCCACTAAA
CCAGGTTCAG CGACTTTGCC TTTCCCTGGG ATTATTGCAG ATATCGTGGA TTTAGAAGGT
AATTCTGTCC CAGAAAATGA AGGTGGTTAT TTAGCGGTTC GTCATCCTTG GCCGGGAATG
ATGCGGACTG TTTACGGTGA TCCTGATCGC TTTCGTCGGA CTTATTGGGA ACATATTCCC
CCCAAAGATG GTAACTATAC GTACTTTGCT GGTGATGGTG CAAGAAAGGA TGAGCATGGC
TATTTCTGGG TGATGGGGCG TGTGGATGAT GTGCTGAATG TCTCTGGACA CCGCCTGGGA
ACGATGGAAT TAGAATCTGC GTTGGTATCT CATCCAGCGG TGGCTGAGGC TGCGGTGGTA
GGTAAACCTG ATGAGTTAAA GGGTGAGATA GTTATAGCTT TTGTGACCTT AGAGGGTACT
TATCAAGCCA GTGAGGAGTT GAGTAAGGAA CTGAAGAAGC ACGTTGTTCA AGAAATTGGT
GCGATCGCAC GTCCTGGTGA AATTAGGTTT ACTGATGCCT TACCGAAAAC CCGTTCTGGT
AAAATTATGC GCCGTTTATT GCGGAATTTA GCCGCAGGTC AGCAGGTATC GGGGGATACT
TCGACTTTGG AAGACCGCAG TGTTTTGGAT AAGTTGCGGG AAGGTGCATA A
 
Protein sequence
MFQPTIESIL QENRLFHPSS NFSQQANIKS LEEYQRIYDQ AKADPQAFWA KLAETELDWF 
QKWDIVLDWQ PPFAKWFVGG KINISYNCLD RHLTTWRKNK AALIWEGEPG DSRTLTYSQL
HREVCQFANV LKQLGFKKGD RIGIYMPMIP EAAIAMLACA RIGAPHSVVF GGFSAEALRD
RLNDAEAKLV VTADGGWRKD AIVPLKEQVD KALADNAVPS VTDVLVVKRT GQKTQMEPGR
DHWWHDLQKG VSADCPAEPM DSEDMLFVLY TSGSTGKPKG VVHTTGGYNL YSHITTKWIF
DLQDTDVYWS TADVGWITGH SYIVYGPLSN GTTTIMYEGA PRGSNPGCFW DIIEKYGITI
FYTAPTAIRA FIKMGEHHPR KRNLSSLRLL GSVGEPINPE AWMWYHKIIG GERCPIVDTW
WQTETGGIMI TPLPGAIPTK PGSATLPFPG IIADIVDLEG NSVPENEGGY LAVRHPWPGM
MRTVYGDPDR FRRTYWEHIP PKDGNYTYFA GDGARKDEHG YFWVMGRVDD VLNVSGHRLG
TMELESALVS HPAVAEAAVV GKPDELKGEI VIAFVTLEGT YQASEELSKE LKKHVVQEIG
AIARPGEIRF TDALPKTRSG KIMRRLLRNL AAGQQVSGDT STLEDRSVLD KLREGA