Gene Aazo_1442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1442 
Symbol 
ID9339236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1511357 
End bp1513342 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content42% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003720792 
Protein GI298490615 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAA TACAACAACC TACTGCTTCT TTTTTATCTA ACGTCACCGA GCAAGAATAC 
CGGGAATTAC AGCGTTTGGT AGACTATACC AATGTAGATT TGCTCCCAGA AATTTGGCCT
TTAGCTGCCA AAAAGTTTGG TAATACTGTT GCCCTCCATA ACCCCCACGC TAAACCAGAA
GTTAAGATTA CTTATAGTCA GTTAGCAGAT CAAATCCAAA GATTTGCAGT AGGTTTGCAG
TCATTAGGAA TGAATATGGG TGACAGTGAA ACACCCAACA TTGGAGAACG CATTTCCCTA
ATTGCTGATA ACAGTCCGGG CTGGTTTATT GCTGATCAAG GTATCATGAC TGCTGGGGCT
GTGAACGCAG TGCGTAGCGC CCAAGCCGAA CGGGAAGAAC TATTATTTAT TATTGCTAAT
AGCGGTAGTA CTGTGCTGGT GGTCGAGGAT ATCAAAACAT TCCAGAAACT AGAGAAGGGT
CTGAAGGACT TACCTATTAA ACTGGTCATT CTTCTTTCCG ATGAAACACC ACCAACAGCA
GAAAATTTGG AACTGGTGAA CTTTTCCCAG TTACTAGAAA TTGGCAGCAA CCACACCTTA
GCACCGATGA AACAAAGCCG TGATAGCTTG GCAACCTTAA TTTATACATC TGGTACTACT
GCTAAGCCAA AAGGTGTAAT GCTTTCCCAT AGCAACTTGC TGCACCAAGT TACAACCTTG
GGAACAGTAG TGCAACCGGA ATCTGGAGAT ATAGTTCTGA GTATTTTACC TACTTGGCAC
AGTTATGAAC GGAGTGGAGA GTATTTCTTG CTTTCTCAAG GTTGCACTCA AGTTTACACA
AATTTACGCT CTGTCAAAGA TGATCTAAAA AACTTTAAGC CTAATTATAT TATTGCTGTA
CCAAGATTCT GGGAATCAAT ATATGAAGGA GTGCAAAAGC AGTTTCGTTC CCAGCCTGCG
AAAAAACAAC AGTTGATTAA ATTTTTATTG GATATGAGCC AGAAATATAT CCAAGCGAGG
AGGATTGCTG AAGGATTGAG TTTACATCAT GTTAATCCCT CAGCTGTTGA GCGATTAGGA
GCAAAAATAC TAGAATTAGC TCTGTTGCCA TTCCAAACAC TGGGAGAAAA ATTAGTTTAT
GCCAAAGTAC GGGAAGCTAC AGGTGACAAA ATCAAGCAGG TAATTAGTGG TGGTGGTGCG
CTTCCCCAAC ATATAGATAA CTTTTTTGAA ATAATTGGTG TAGAAATTTT ACAGGGCTAT
GGCTTGACGG AAACCTCACC AGTAACAAAT GCCCGTCGTC CTTGGCGAAA TTTGCGAGGA
TCATCTGGGC AACCAATTCC GGGAACAGAA GTTAAGATAG TTAGTCCTGA GACTCGTCAG
CCACTACCAG CAGGAGAACG TGGTTTGGTG TTGCTCAGAG GGCCACAAAT TATGCAGGGC
TATTATCAAA ATCCGGAAGC GACAAAAAAA GTCATAGATG CTGAAGGTTG GTTTGATAGT
GGTGATTTAG GCTGGGTGAC ACCCCAAAAC GACTTGGTGC TAACTGGTAG GGCAAAGGAT
ACGATTGTTT TAACCAATGG GGAGAATATC GAACCCCAAC CTATAGAAGA TGCTTGTTTG
CGATCGCCCT ACATTGATCA AATCATGTTA GTGGGACAAG ACCAGCGCAG CCTTGGGGCG
TTAATTGTTC CCAATCTCGA AGCCTTGGAA AAATCGGCAG CAAATCAGAA TGATAATATT
ACTGCCTCCA GCGGTCAAAA AATTGACTTA GAGAGTAAAA TGATCCAGGA TTTGTTTCGG
CAAGAATTGA ATCGGGAAGT AAAAAACCGT CCCGGTTATC GAGCCGATGA CCGGATTGGC
CCATTCCAAT TGATTATCGA ACCCTTTTCC ATTGAAAATG GCATGATGAC ACAAACTCTA
AAAATCCGTC GTCACGTCGT CACGGACGAG TATCACGATA TTATTGACCG AATGTTTGCC
AAATAA
 
Protein sequence
MTKIQQPTAS FLSNVTEQEY RELQRLVDYT NVDLLPEIWP LAAKKFGNTV ALHNPHAKPE 
VKITYSQLAD QIQRFAVGLQ SLGMNMGDSE TPNIGERISL IADNSPGWFI ADQGIMTAGA
VNAVRSAQAE REELLFIIAN SGSTVLVVED IKTFQKLEKG LKDLPIKLVI LLSDETPPTA
ENLELVNFSQ LLEIGSNHTL APMKQSRDSL ATLIYTSGTT AKPKGVMLSH SNLLHQVTTL
GTVVQPESGD IVLSILPTWH SYERSGEYFL LSQGCTQVYT NLRSVKDDLK NFKPNYIIAV
PRFWESIYEG VQKQFRSQPA KKQQLIKFLL DMSQKYIQAR RIAEGLSLHH VNPSAVERLG
AKILELALLP FQTLGEKLVY AKVREATGDK IKQVISGGGA LPQHIDNFFE IIGVEILQGY
GLTETSPVTN ARRPWRNLRG SSGQPIPGTE VKIVSPETRQ PLPAGERGLV LLRGPQIMQG
YYQNPEATKK VIDAEGWFDS GDLGWVTPQN DLVLTGRAKD TIVLTNGENI EPQPIEDACL
RSPYIDQIML VGQDQRSLGA LIVPNLEALE KSAANQNDNI TASSGQKIDL ESKMIQDLFR
QELNREVKNR PGYRADDRIG PFQLIIEPFS IENGMMTQTL KIRRHVVTDE YHDIIDRMFA
K