Gene Aazo_3223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3223 
Symbol 
ID9341027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3315395 
End bp3316810 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content41% 
IMG OID 
Productcell envelope-like transcriptional attenuator 
Protein accessionYP_003722054 
Protein GI298491877 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTAGTC AAAGAACATC AGCAGAAAAT AATAAATCAG CAAAAGCGAA AACCAGAAGT 
AAAACCTCTC GTAAATCAAA ATCAGGGCGT TGGCTATGGT TTGTGGTGGG TATGGGTGGG
ATTGCAATGG TTTCAGGTAT GGCGGGAGCT TTGTTGGCAG TTTCTTGGGA CAGTACACCT
TTGCAGCAAG AGCAGTTGAG TGCCAAGGAT GCAGCAGTTT TTGACGGTGA TCGCATTTCG
GGAAATGGAT TGCAATTTTC CCAATTAACT CGCCCAGTGA ATATCTTAGT TATGGGCATG
AGTGTACTGC CACCAGATGT TCAAAACCAA CCCAGTGATA CCAAAGAACT TAAATATCTA
CCCCAGATCA ATTCTTTTGA TGGTCTCTCT GACGTGATGC TCTTAATCAA ATTTGATCCA
GAGACAAAAA AAATTGTCAT GCTTTCTATT CCTAGAGATA CTCGTGCAGA AATAGAAGGG
TTTGGTGCCA AAAAAATTAA CGCCGCCAAT GTCGATGGTG GACCAGCTTT AACTGCTAAA
GCCGTCAGTA ATCTCTTGGG TCGAGTGGGA ATTGACCGTT ATGTCCGCAT TAATGTCCTG
GGGGTTGCCA AGCTGATAGA TGTTTTGGGT GGGGTAACAG TTTACGTTCC CAAAGATATG
AAATATCAGG ATGATTCACA GCATTTATAT ATTAATTTAA AGGCAGGTAA ACAGCATCTT
AAAGGTGAAC AAGCCTTACA GTTGCTGCGT TTTCGCCATG ATGAACTAGG TGATATTGGA
AGAATTCAGC GTCAGCAAAT GGTCTTGCGT TCTTTGATTG AACAAACTCT CAATCCCTCA
ACATTAACGC AATTGCCCCA AATTTTGAAT GTAGTTAAAG ATAATATCGA CACTAATTTA
ACAGTTGAAG AATTAGTTGC GTTAGTTGGT TTTGGTTCAC GAACTAATCG TTCTAATATG
CAGATGTTGA TGTTGCCTGG ACGCTTTAGT GGAAAGAGTG AGTATGATGC CAGTTATTGG
ATACCCCAGA AACGAGCAAT CAACAAATTA ATGGTTCAGA ATTTTGGTTT AGAATCAGAA
TTATTAGACA CTGAAACAAT AGACCTTGGT GCATTGCGAG TAGCGATTCA AGATAGCACA
GGTGGCGATC ACTCTCAAAT CCGTCCCCTA ATTATAGCCT TGGAAAAAGC CGGATATCGC
AACATCTTTA TCTCTAAACC ATGGGGTGAA CCTCTGGAAA TTACCCATAT CGTCGCCCAA
CAAGGAGACA GTGAAAGCGC CGAATCAATT CGTAATACTT TAGGATTTGG CGAAGTGCGA
GTAGAAAGCA CAGGTAATAT CGGTTCAGAT ATCAGCATCC AAGTCGGTAA AGATTGGTTA
GAAAAGAAGG CAACTTTTGA AGCCTATAGT AGGTAA
 
Protein sequence
MTSQRTSAEN NKSAKAKTRS KTSRKSKSGR WLWFVVGMGG IAMVSGMAGA LLAVSWDSTP 
LQQEQLSAKD AAVFDGDRIS GNGLQFSQLT RPVNILVMGM SVLPPDVQNQ PSDTKELKYL
PQINSFDGLS DVMLLIKFDP ETKKIVMLSI PRDTRAEIEG FGAKKINAAN VDGGPALTAK
AVSNLLGRVG IDRYVRINVL GVAKLIDVLG GVTVYVPKDM KYQDDSQHLY INLKAGKQHL
KGEQALQLLR FRHDELGDIG RIQRQQMVLR SLIEQTLNPS TLTQLPQILN VVKDNIDTNL
TVEELVALVG FGSRTNRSNM QMLMLPGRFS GKSEYDASYW IPQKRAINKL MVQNFGLESE
LLDTETIDLG ALRVAIQDST GGDHSQIRPL IIALEKAGYR NIFISKPWGE PLEITHIVAQ
QGDSESAESI RNTLGFGEVR VESTGNIGSD ISIQVGKDWL EKKATFEAYS R