Gene Aazo_4567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4567 
Symbol 
ID9342372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4655860 
End bp4657449 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content43% 
IMG OID 
Productcarbohydrate-selective porin OprB 
Protein accessionYP_003722947 
Protein GI298492770 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.316045 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAT TCTGGGCATA TCTATTAGCA AGTCCGGCAA TTTGCGGTGC GATGCTTTCT 
ATGGGGACGG GTCCATCAGC CGGGGAAGTA ACAACGACAA CGGAAACTAG TGAACCAATT
GGAATTACTA ATCTAACGCC AAAACTTGGA AAAATTAATC AACAGGAGTT GATATCACAA
GTAACATCAG TATCTGAGTT ATCAGACGTG CAATCCACAG ATTGGGCATT TCAAGCTTTG
CGATCTTTGG TAGAACGTTA CGGTTGTATT GCTGGCTATG CCAACGGTAC TTATCGTGGT
AATCGTGCTA TGACAAGGTA TGAATTTGCA GCCGGGTTGA ATGCTTGTTT AGATCGAGTT
AACGAATTAA TTGCCACTGC TACTGCTGAT CTAGTAACAA AACAGGATTT AGCCACACTG
CAAAAACTAC GAGAAGAATT TGCGGCTGAA CTAGCAACCT TGCGGGGTCG TGTAGATGCG
CTGGAAGCCA GGGCTGCTGA GTTAGAAGCC AATCAATTTT CTACCACCAG CAAATTGCAA
GGGCAACTAG TCGCCGTCTT CAGTGAGGTT TTTGCTGGTA ACAGTGGTGA CAATAAAAAT
AGCACTCTAG GCGCACGGGC GCGGATTGAA TTTGTCAGCA GCTTTAGTGG TCAAGATACG
CTGTTTACCA GAATTGAGAG TAATAATATC AATAGCCCTA TCAGCAGTCC ACAACAGGGT
AATTTGTTTT TTGCTGGTAG TGGTACTAAC GATACTTTCT TAGGTACACT GTGGTACAAA
TTCCCAGTAG GTAACAAAAC ACAGGCAATA GCTATTGCTA CTGGCGGTGC AGCAGATGAC
CTTACCAGCA CAATTAATAT TTTTGATGGT GATGGTGATG GTGCTTTGTC CACCTTTGGT
ACACGCAACC CAATCTATAA CCAGATCAGT GGTGCAGGTT TGGGAGTAAA TCACCAGTTC
AACAAGAATA TAACCTTAAG TTTAGGGTAT TTAGCAGGTA CTACCGACAA TCCTGCCTCA
AACCCTGCCT CTAAAAATGG TTTGTTTGAT GGACCCTATG GTGCAATGGC ACAGTTGACC
CTCAAACCAT GTGATCGCAT TGCCCTTGGT TTAATCTATA TCAATTCCTA CAATCAACCA
ATACTCACAG GTAGCGAAGC TGCAACATCT GATATTAGCA GTGAAGCATT TTCCAGTAAT
TCTTACGGTC TCCAAGCATC CGTTGCCATC AGTGAGAAAT TTGTATTGGG TGGTTGGGCT
GGATATACCC GGAGTCAAGT GTTAACAAGA GAGAAAGGAG ATGTAGACAT TTGGAACTAT
GCCGTTACCC TTGGTTTTCC AGACTTGGGT AAAAAAGGTA ACTTAGCTGG TATGATCCTG
GGCATGGAAC CGAAAGTTAC GAGTTCTAGC ACCTCAGTGG TGTCTGAAGA CTTAGATACC
TCATATCACA TTGAGGCATT TTATCAATAC AAAATTAGTG ACAATATCAC AATTACCCCT
GGTGTTATTT GGATAACAGC ACCAGACCAT AATGATACTA ATAATAATGA TGTGGTAATT
GGTGCTTTGA GAACCACCTT CAGTTTCTAA
 
Protein sequence
MQKFWAYLLA SPAICGAMLS MGTGPSAGEV TTTTETSEPI GITNLTPKLG KINQQELISQ 
VTSVSELSDV QSTDWAFQAL RSLVERYGCI AGYANGTYRG NRAMTRYEFA AGLNACLDRV
NELIATATAD LVTKQDLATL QKLREEFAAE LATLRGRVDA LEARAAELEA NQFSTTSKLQ
GQLVAVFSEV FAGNSGDNKN STLGARARIE FVSSFSGQDT LFTRIESNNI NSPISSPQQG
NLFFAGSGTN DTFLGTLWYK FPVGNKTQAI AIATGGAADD LTSTINIFDG DGDGALSTFG
TRNPIYNQIS GAGLGVNHQF NKNITLSLGY LAGTTDNPAS NPASKNGLFD GPYGAMAQLT
LKPCDRIALG LIYINSYNQP ILTGSEAATS DISSEAFSSN SYGLQASVAI SEKFVLGGWA
GYTRSQVLTR EKGDVDIWNY AVTLGFPDLG KKGNLAGMIL GMEPKVTSSS TSVVSEDLDT
SYHIEAFYQY KISDNITITP GVIWITAPDH NDTNNNDVVI GALRTTFSF