Gene Aazo_2972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2972 
Symbol 
ID9340776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3055446 
End bp3057917 
Gene Length2472 bp 
Protein Length823 aa 
Translation table11 
GC content42% 
IMG OID 
Productfilamentous hemagglutinin family outer membrane protein 
Protein accessionYP_003721898 
Protein GI298491721 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTAA CTTTTTCTAT CTTTGCTTTA ATTAGCGTCG TATTAACATC CGTCACTTAT 
AACAGTCGCG TTCAGGCAAA AGTGACTCCC GACAGCAAAC TCAAAACTAC TGTGACTGGC
AGTAATAATT ATACCATCAC CAATGGTAAC CGTGTCGGCA ATAATTTATT CCATAGCTTT
AGTGAATTCT CTATTCCTAC TAACGGATCT GCATTCTTCG ATAATGCTAA CGATATCCAG
AATATTTTTA GCCGCGTGAC TGGCGGCAAT CTTTCCAATA TTGATGGCTT GATTCAAGCC
AATGGTAGTG CCAATTTATT CTTACTCAAC CCATCTGGGA TTATTTTTGG TGCAAATGCC
CGCTTAGATA TTGGTGGTTC ATTTGTAGGA ACAACTGGCA ATAGTATTAA GTTTGCTGAT
GGCACAGAAT TTAGCGCAGT CAATGCGAGT AGTTCGCCAT TGTTAACCAT GAGCGTACCG
ATAGGCTTGC AAATGGGTCA AGATTCGGGA GCCATCACAG TCCAAGGGTT AGGGCATCGC
ATTATACCAC CATTCTCTGT AGCACAGGAA TTGGATCTCA GCAATAATCC CACTGGGTTA
CAAGTCAAGG CAGGTAATAC CCTGGCACTC ATTGGTAGTG GGCTAAATTT CGCGGGGGGC
ATTGTGGCAG CAGACGGAGG TGGACATATA GAAATAGGGA GTATCAATCA TGGGCTAGTC
AGACTCAATT CTACAGTGAC AGGATGGAAG GGAGATTACT CACAAGTAGA ACAGTTTAAT
GATATCCATC TAGCCCAACA ATCTCTACTA GATGCTAGTG GCAGCAATGG TTCAATTCAA
CTACAGGGGC AAAATATCAA CTTAACTGAA GGTTCTACTG TAGTAATACA AAACTTGGGG
ACACAATCGC AAGGAATTAC TGTCCACGCC ACAGGTTCTT TGAATTTGAC AGGCTATACT
CCCGACCAAA AACAGGGCAG TATAATCGCA ATCGAAAACT TGGGAACAAG TTCATCAGGA
GATATTGTAG TTTCCGCCAA TCAACTTTTC GTACAAGATG GTGGACAGAT TCAGACTTTC
ACTCCTACCG CAGCAGCCAG CGGGAATATT TCAATTGATG TTGAAGACTT GATTTACCTG
AATGGTTTTA TCCCTACTAA CCCGACTGTG AACACCAAGA TCATAACAAT CACGGATGGC
TCTGGTAAAG CTGGTGATAT TACCATTTCA TCGGGCAGCT TAAAAGTTTT CAATGGGGCT
AGTCTCATTT CTGTGACAAT GGGTTATGGG GAGGGCGGAG CGATGCAGAT CAATGCCAAA
GACCTCATCG AGATTGTCGG TAACAATCCC ATTATCTTAG TACCTAGTGC AATCTCTTCG
GCAACGATTG GTACCGGCAA TGCAAATAGC ATATTGGTTA ACACATCCAG ATTAATCCTT
AGAGATGGGG GAGTTCTGGG TTCTAACACT CTGAGTCAGG GTAGAGCGGG AAATGTAACA
GTTAATGCTT CAGATTTCCT AGAGGTTAGT GGTAAAGCAC CTGGATCAAT TAGATCGAGT
AACATTATGT CCTCATCTGA GATTCTCGAT CAAGTTGTTC AAGAGACTTA TGGACTACCG
TCAATTCCCA CTGGTGATGC CGGTTTTCTG ACGATTAATA CCCCATCATT ACGCATTAGT
GATAGTGCAT TCGTGAGTGT TAAGAATGAT GGACCTGGCA GAGCTGGAGA TTTACAAATT
AACGCTAATT TGCTTTTTCT ATACAAAGAA GGTAGTATCA GTGCATCTAC TGCTTCAGGA
AATGGAGGTG ATATTCAGTT AAACTTACAA GATTATCTGT TGATGTATCA AGATAGTGTT
ATTTCCGCTA CTGCTCAGGG TAATGGAAAT GGCGGTAATT TGTCAATTAA CTCACCAGTA
ATTGTTGGTT TAGAAAACAG TGACATCATC GCCAATGCAG TTCAAGGTCG TGGTGGCAAT
ATTAATATCA GCACTCAAAA CATAATCGGT CTAGAGTTTC GCGATATCCT CACTCCCCGC
ACAGTCCCAA CAAATGATAT TACTGCTAGT TCCCAGTTTA ATGTTAATGG CACAGTGCAA
ATTAATAACA TCAGTATTGT CCCCAGTTCT GGTTTAGTCG AACTACCTGC AAATATTACT
GACCCATCAC AGCAAATAGC TATAGGATGT GCAGATACTA GTGGCAGTAG TTTTGTCGCG
ACAGGACGAG GTGGAATACC CCAAAATCCC ACTCAGGAAG TGAGGAGCGA TAAACCTTGG
TCTGATGTTC GCGATCTCTC TTCATATCGC ACAACAGCAC AAGTGCAAGC ACAAGTACTT
CAATCCCGAG CGAATTTTAT ACAAGCTACT TCCTGGCATC GTAATTCCCA AGGGAAAATT
GAGTTAGTTG TAGATAAATC TTCTATGAGT ATGCAACCGT CATTAACCTG TGTTGCTGTT
CCTAAAAGTT AA
 
Protein sequence
MKVTFSIFAL ISVVLTSVTY NSRVQAKVTP DSKLKTTVTG SNNYTITNGN RVGNNLFHSF 
SEFSIPTNGS AFFDNANDIQ NIFSRVTGGN LSNIDGLIQA NGSANLFLLN PSGIIFGANA
RLDIGGSFVG TTGNSIKFAD GTEFSAVNAS SSPLLTMSVP IGLQMGQDSG AITVQGLGHR
IIPPFSVAQE LDLSNNPTGL QVKAGNTLAL IGSGLNFAGG IVAADGGGHI EIGSINHGLV
RLNSTVTGWK GDYSQVEQFN DIHLAQQSLL DASGSNGSIQ LQGQNINLTE GSTVVIQNLG
TQSQGITVHA TGSLNLTGYT PDQKQGSIIA IENLGTSSSG DIVVSANQLF VQDGGQIQTF
TPTAAASGNI SIDVEDLIYL NGFIPTNPTV NTKIITITDG SGKAGDITIS SGSLKVFNGA
SLISVTMGYG EGGAMQINAK DLIEIVGNNP IILVPSAISS ATIGTGNANS ILVNTSRLIL
RDGGVLGSNT LSQGRAGNVT VNASDFLEVS GKAPGSIRSS NIMSSSEILD QVVQETYGLP
SIPTGDAGFL TINTPSLRIS DSAFVSVKND GPGRAGDLQI NANLLFLYKE GSISASTASG
NGGDIQLNLQ DYLLMYQDSV ISATAQGNGN GGNLSINSPV IVGLENSDII ANAVQGRGGN
INISTQNIIG LEFRDILTPR TVPTNDITAS SQFNVNGTVQ INNISIVPSS GLVELPANIT
DPSQQIAIGC ADTSGSSFVA TGRGGIPQNP TQEVRSDKPW SDVRDLSSYR TTAQVQAQVL
QSRANFIQAT SWHRNSQGKI ELVVDKSSMS MQPSLTCVAV PKS