Gene Aazo_3920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3920 
Symbol 
ID9341724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3982670 
End bp3983848 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content34% 
IMG OID 
ProductDevC protein 
Protein accessionYP_003722546 
Protein GI298492369 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGTAA AAATCCCTTT AGCATGGCTA CAGCTTGCCC AGCAAAAAGT ACGTTTTCTT 
GTAGCTGTAG CCGGAATAGC CTTTATTGTA CTGCTGATGT TTATTCAACT TGGGTTCCAA
GATGCACTTT ATTCTAGTGC TACAGCATTA CATCAAAATC TCAAGGGTGA TTTGTTTTTA
GTCAGTTCTC AATATAAAGC TTTGACTGCT AATCAAAGTT TTTCTCGAAA TCGTTTATAC
CAAACATTAG GTTTTAATGG TGTCGAATCA GTTAGCCCTA TATATTTGCA ATTTGCCAAA
TTAAAAAATC CTGCTACTGG CGAAAAATAT TCAATCTATG TCATAGGTTT TGACCCAGGA
AAACCAGTGA TGAATCTACC AGAAGTCCAG AATAATTTGG ATATACTTAA AAATACTGAT
GTCATGTTAT TTGACAAGAA TTCTCGCCCA GAATTCGGTC CAATAGCAGA AAAGTTTGAG
CAAGGAGATA CTGAACAAAC AATTGAAATC TTTCCCTTTG ATTCTCTTCA AGGTTATCGA
GTCAGAGTCG GTGGTTTATT CGGTTTAGGA CCGTCCTTTG GTGTCGATGG AAATTTAATT
GTTAGCGACT CAACTTTCTT AAAGATTAAT CCTAATACCC GTCATGCAGA AAACATAGAT
GTAGGTATTA TTAAAGTCAA ACCAGGTTTT GACCCAAATG AGGTTCTAAA AGATTTGCAA
GCAAGTCTAC CTAATGATGT ACAGATATTT ACTCGTAAAG GCTTTATTAA TTTCGAAAAA
GAATATTGGG CAGCTAGAAC ACCCATAGGT TTCATACTTA ATCTCATGCT AACTATGGCC
TCTGTGGTGG GTGTAGTTAT TGTTTATCAA ATTCTTTACA GCAATATTGC TACTCAATTT
ATTGCCTACG CAACATTAAA AGCTATTGGC TACCCTAATG CTTATTTATT AAATGTAGTT
TTTCAACAGG CATTAATCTT AGCTTTATTA GCTTATATAC CAGGATTTAT TTTCTCCGTT
ACCTTATATG ATTTTGCGAT GGAAGTAACT AAATTACCAA TCATTATGAC TTCTAATAAT
GCCTTAATTG TTTTAACTTC TACAGTTCTA ATTTGTATAA CTTCTGGAGC ATTAGCTATT
AATAAACTTC GCTCTGCAGA TCCGGCTGAT ATTTTCTAA
 
Protein sequence
MIVKIPLAWL QLAQQKVRFL VAVAGIAFIV LLMFIQLGFQ DALYSSATAL HQNLKGDLFL 
VSSQYKALTA NQSFSRNRLY QTLGFNGVES VSPIYLQFAK LKNPATGEKY SIYVIGFDPG
KPVMNLPEVQ NNLDILKNTD VMLFDKNSRP EFGPIAEKFE QGDTEQTIEI FPFDSLQGYR
VRVGGLFGLG PSFGVDGNLI VSDSTFLKIN PNTRHAENID VGIIKVKPGF DPNEVLKDLQ
ASLPNDVQIF TRKGFINFEK EYWAARTPIG FILNLMLTMA SVVGVVIVYQ ILYSNIATQF
IAYATLKAIG YPNAYLLNVV FQQALILALL AYIPGFIFSV TLYDFAMEVT KLPIIMTSNN
ALIVLTSTVL ICITSGALAI NKLRSADPAD IF