Gene Aazo_1153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1153 
Symbol 
ID9338948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1236866 
End bp1238776 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content40% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003720602 
Protein GI298490425 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAAG TCACATTGAA AAAAGAAGAT ATGCAGCTTA TAGCCAGTAT TTTAGAAGAA 
CAACTACAGA TGGAACTTCC ACCTGACAAA GTTTTTCAGG TTCGATGTGC CATCCAAAAT
GACGAACTGA TGGTTTTAAT TCAACACCCT GTAAATGTAT CTCTTGAGAC GCAAAAAATA
TTTGCGGTAC TTGAGGAAAC ACTCCAGTCG CAATCTCAAT ATCATGAGCA AAGGGTGAAA
TTTTACGTGA GAGTTTCTGG AGAAAAAGTA CCTTACACTA AGAATTCCAT AATTGTTAAG
CCTAAAAAGA TACAAGAGGA TTCAAGAATT ATCGGAGAAG AAGCCAGCAC AGGCTCAGAT
TACAGTACAG CAAATCACCA ATTGATTTTT CCTCCACTCA ATATTCTCAA CCCTCCCTCT
CCATCTCCTA CCACAGAAGA CAATATCTCG GATAATCCAT TTAACTTACT AGTAGACGAC
CATATGTCGG ATAATCCATT TAACTTACTA GTAGATGACA ATATCTCAGA TAGTTCATAT
AGCTCAGATA CTTTATTCGG CTCAGATCAT TCATTGCTTG AGGAGAGTAA GGATAGCCCA
GGAGAAGCAG AAAAGTTTGA CCCCTTCGGA GATGGAAAAA ATTTATCGAA AACCAAACAC
CTATCTGTTT CATCCTTACC CATCCTCTTA GGTGGGGTAT TGGGAGTGGC AGTTATTTTT
GGTGGTGGTA ATTTTTTTCT GATTCGTGCT TGTGTGATTG GTGAGTGTAA AGAATTACAA
ATAGCCCAGC AATTCAAAAG CAATACCCAA GAATTGATCC GTCAGGCTAA GTCTCAAAAA
GAACTAGTAG CAGTACAACA ACAAATAGAT ACAGTTATTT CTGATCTCAA AGTAATTCCT
CAATGGTCGC CTCGTTATCA ATCAGCCCAG GAAATCAATT TAAGTTTTTC TGACCAATCA
GCAAAAATTA CCCAGGTACT AAAAGCTTTA GAGTCTGCAA ATATAGCAGA GAAAAAAACC
AAAACCCCTC CAACTAGCCT GGAAGAGCTA CGTGCTAGAC AAAGTTTATG GCGACAAGCA
ATCATACCAC TAGAGTCTAT TAAACCTGGT AACGAACTAT ATGGACTAGT GCGGGGAAAT
TTGTCCAAGT ATCAAAGCAA TTTACAGACT CTCAATCAGC AATTGCTCAG TGAAGAATCA
TGGCTGAAAA AACTCACCAC AGCCAAAACT GTAGCTGAAT CAGCCCTCAA GCGTGAAGCT
AATGCTAAAT CGGATAATGA CTGGCAAAGG GTACAGTTTG CTTGGCAAGA AGTTGTTAAT
GATTTGAAAA GTATTCCCTC AAACAGCACA ACACACCAGG AAGCAAAAAA CCTTTTAACG
GATTATCAGC CTAAACTTAG GTTAGCACGC AACCGCGCTA AGAAACAAGC TGCTGCGGCT
CTCAACTTCA AACAAGCCGT CAACATGGCT AATCAAGCTA AAGTTTATGA AACACAAAAT
AAATGGCAGG CAGCAATAGC ATCTTGGGAG CAAGCTGTAC AGATAGCTAA ACAGGTTTCT
CAAGATAGTT CTTTCTACAG TCAAGCACAA TCCCTCATTC AACCCTATTC CACTGCTCAA
GCACAGGCAA AAGAAAAACA ACAACTCGAT GGTAATTTGG CACAAACTCG CGCTGATTTG
GGAAAAACCT GTGTTAATAA GATGCGGTTT TGCATTTTTA ACATCGAAAC TAGGGGTATT
GTTGTCCGTT TAACCCCAGA GTATGACCAA GCATTACAAA GTAACCCTGG TGTCCAAAGT
CATTTACAAG CCTTGCAAGA GGCTTTAGGA GTCATCAGTG AAAATTCTAA CCTACCCGTA
TTTCTCTATA ATTCCCAAGG ACAGGAAAGG TATATGAAGA TGCCGCAATA G
 
Protein sequence
MIKVTLKKED MQLIASILEE QLQMELPPDK VFQVRCAIQN DELMVLIQHP VNVSLETQKI 
FAVLEETLQS QSQYHEQRVK FYVRVSGEKV PYTKNSIIVK PKKIQEDSRI IGEEASTGSD
YSTANHQLIF PPLNILNPPS PSPTTEDNIS DNPFNLLVDD HMSDNPFNLL VDDNISDSSY
SSDTLFGSDH SLLEESKDSP GEAEKFDPFG DGKNLSKTKH LSVSSLPILL GGVLGVAVIF
GGGNFFLIRA CVIGECKELQ IAQQFKSNTQ ELIRQAKSQK ELVAVQQQID TVISDLKVIP
QWSPRYQSAQ EINLSFSDQS AKITQVLKAL ESANIAEKKT KTPPTSLEEL RARQSLWRQA
IIPLESIKPG NELYGLVRGN LSKYQSNLQT LNQQLLSEES WLKKLTTAKT VAESALKREA
NAKSDNDWQR VQFAWQEVVN DLKSIPSNST THQEAKNLLT DYQPKLRLAR NRAKKQAAAA
LNFKQAVNMA NQAKVYETQN KWQAAIASWE QAVQIAKQVS QDSSFYSQAQ SLIQPYSTAQ
AQAKEKQQLD GNLAQTRADL GKTCVNKMRF CIFNIETRGI VVRLTPEYDQ ALQSNPGVQS
HLQALQEALG VISENSNLPV FLYNSQGQER YMKMPQ