Gene Aazo_3787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3787 
Symbol 
ID9341592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3845838 
End bp3847508 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content40% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003722445 
Protein GI298492268 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGAAA GTAAGTCTAA ATTTTTAGTC CCTGTTATTA GCACTGCTAT AGTCGTCGCA 
GGTGGGATAG CTGCTTATGT ATATTTTAAA GTGCCCTCTG AAGATGTTTC CAGTCCTCTG
GGAATTGCTA AAGTAGTACC GGCTAATGCC TTGATGGCGA CTTATATTAA CACAGATTCC
CAATCTTGGA GTAAGTTACA GCAGTTTGGA ACTCCACAAG CACAACAACT AGTATCCAAA
GGTCTACAGG ATATCAACAA ACAACTATTA AGTGATAGCA ATATTGTTTA TGAAACAGAC
ATAAAACCTT GGATTGGTGG AGTCATGATT GCTGTGCTAC CACCAAATTC TACTATACGT
AATCCACCAA ATCCACCAAT TCCAGTACAG CCAGAGCCAA ATATTTTGTT GGTAGTAGGT
ATAAAAGATA AACTCAATGC CTTGAAATTT GCTACTAAAT TGAAGGAGCA AAAAAACTTA
CAAATTCAAG AATCAGAATA CAAAGGTGAG AAAATTATTG CTAGTACAAG CAAGACTAAA
TCGACTTACA TGGTTGTTTT GAATAACACT CGTATACTGT TGACACCAGA AAAACAAGCT
GTAGAAAAAG CTATTGATAC CTATAAAGGT AAGCCATCCT TTGCCAACAA AGAAGGCGCA
AGTAGTATTT TAGCTAAAGG TGTAGATGTT CAAAACAGCC TTGCTCAAAT TTATGTGCCT
GATTACGCCA ATATGGCACA ACAGTTAACA GCTTTCAATC CACAGTCCAG GCCATTACCC
CCAGAAACAT TCGCACAACT CAAGCAAGTA AAATCAATGG TAGCGGCTGT GGGTGTCGAT
GATGCTGGAG TGAGAATGAA AGTAGTAGTG AACTTAGATC CGCAACTGAA CAAATTTCAA
TATCAAAATA CTCCGGCTAA GATAGTGGCA CAATTTCCCA GTGATACTTT TGCTTTAGTC
ACCGGACAGA ACATAAATCG TAGCTGGCAA ACCTTCCTGG AACAGTCAAA AGATTATCCT
GAAATTAAGC AAGGTGTGGA ACAAGCACGA GGACAACTAA AACAAGCGGT CAATCTGGAT
TTAGATAAAG AAATTTTTGG TTGGATGGAT CAAGAATTTG CCTTGGGTGC GGTGAAATCT
AGTCAAGGTT GGTTAGCCAA TGTTGGTTTT GGGGGAGCGA TGGTATTTGA CACCAGTGAT
CGCAAAACAG CGGAAGCCAC CTTCACTAAA CTAGATGACC TAGCCAAAAA GCAATCACTC
AACATCACCA AAAGAAGCAT TGGTGGTAAA AATATCACCG AATGGCAAAT TACCCAACAA
GGCACTTTCA TAGCACATGG TTGGCTAGAT CAGGATACCG TATTTCTCGC TATTGGTGGA
CCAGTTGGTG AAGCGCTAGC AGACAAAAAA GGTCAACCCC TGGATAATAC GAACACATTT
AAAGCTGTAA CGAGTTCCTT GCAAAAACCC AACGGTGGTT ATTTATACTT GGATTTAGAA
AACACCTCTT CTTTAATTAC CCGTTTAGCC ACACAAGGTA AACCTCTTCC CCTGGAAACC
AATGCTGTCC TATCATCCAT TCGTGGTTTG GGTGTGACAG TGAATAGCCC CGATAAATCC
ACCAGTCAAA TGGAAATGTT GTTAGCTCTT AAACCAAGTA GTAGTAAATA A
 
Protein sequence
MPESKSKFLV PVISTAIVVA GGIAAYVYFK VPSEDVSSPL GIAKVVPANA LMATYINTDS 
QSWSKLQQFG TPQAQQLVSK GLQDINKQLL SDSNIVYETD IKPWIGGVMI AVLPPNSTIR
NPPNPPIPVQ PEPNILLVVG IKDKLNALKF ATKLKEQKNL QIQESEYKGE KIIASTSKTK
STYMVVLNNT RILLTPEKQA VEKAIDTYKG KPSFANKEGA SSILAKGVDV QNSLAQIYVP
DYANMAQQLT AFNPQSRPLP PETFAQLKQV KSMVAAVGVD DAGVRMKVVV NLDPQLNKFQ
YQNTPAKIVA QFPSDTFALV TGQNINRSWQ TFLEQSKDYP EIKQGVEQAR GQLKQAVNLD
LDKEIFGWMD QEFALGAVKS SQGWLANVGF GGAMVFDTSD RKTAEATFTK LDDLAKKQSL
NITKRSIGGK NITEWQITQQ GTFIAHGWLD QDTVFLAIGG PVGEALADKK GQPLDNTNTF
KAVTSSLQKP NGGYLYLDLE NTSSLITRLA TQGKPLPLET NAVLSSIRGL GVTVNSPDKS
TSQMEMLLAL KPSSSK