Gene Aazo_1067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1067 
Symbol 
ID9338863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1140613 
End bp1142103 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content41% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003720547 
Protein GI298490370 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.363484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCACCG GGTACATCCT GATAGCAGCT ATTTTGATTT TGGGAGGTGT GATTGCTACA 
GTGGGCGATC GTATCGGCAC ACGAGTTGGC AAAGCACGCC TCTCACTATT TAAGCTGCGT
CCCAAAAATA CGGCAGTGCT GGTAACTATT TTTACTGGTG GTCTAATTTC TGCCTCAACC
CTAGCAATTT TATTTGCTGC TGATGAAGGA TTGCGGAAGG GCGTCTTTGA GTTAGAGGAT
ATTCAAAAAG ACCTGAGAAA CAAGCGAGAA CAACTTAAAA CCGCAGAAAC GGAAAAAAGC
CAAGTAGAAA AACAGCTGAC CGAAGCTAGA AAAGAACAAA CTCAGGCACA ACAAGATTTA
CAAAACATTA ATCAGTCTTT ACAAGCTGCC AATGCTAAAC AACGCATAAC ACAAGCTCAA
CTCAACCGCA CCATTAGTCA ACAAGCTAAA ACCCAAGCTC AACTGCAAGG TACTCAAAGC
CGACTTGGTG AAGTAGTGAT ACAGTATAAA CAAGCTAGAA CTGAACTACA AACCCTTTAT
AATCAACGTC AGGCATTGCA AACAGCAGTT GAAGAATTAA AGACAGAACG GCAGCGACTA
TATGCAGAAG CGAAAAAAGC GATTGACGAA GCAAAAACAG CTATTGAAAA ACGCGATCAG
GAACTTGCTA ATCGCCAAAA AGCTATTGAA AAACGTGATC AGAAAATTGC TAAATTAGAT
CAACTAATTC AAAATCGTAA TCTAGAGATT AAAAAACGGG AGCAAGTAAT TGCTACTAGG
GAATCTCGTC TCAAAGAATT GGAACAACAG CAAGATTATT TAGAACAAGA AGTAGCAAGG
CTGGAAAAAT ATTACCAGTC ATATCGTGAC CTGCGTTTAG GTAAATTGGC TTTAGTTCGT
GGACAAGTTT TAGCTGCTGG TGTGGTACAA GTTAATCAAC CTACTGCGGC TCGTCAGGTA
CTAGTCCAAA TTTTGCAGGA AGCTAACCGC AATGCCAACA TTGAATTAAG CGAACCTGGT
TCTAATCCTG GGAATGCAGA ACTATTGCGT GTTACTCAAG ATAGGGTTGA GCAACTAATC
AATCAAATCG ACGATGGAAG AGAATATGTA GTGCGAATCT TCTCTGCGGG TAATTACGTT
AGGGGAGAAA AGCAGATAGA ATTTTTTGCT GATGCTACGC GCAATCAATT GGTATTTTCC
ACAGGTCAAA TCCTGGCTAC AACTGCGGCT GATATGAAAA ACATGACATC ATATCAATTA
AGGCAACGGC TGGACTTGCT GATTTCTGCT TCCCAATTTC GCGCTCGCAA TGCAGGAATT
ATCGAAACTG TACAAGTAGA GGGTACTTTT CTGCGCTTTT TCGCCCAATT GCAACAGTCT
AATCAACCAT TAGAAATTAA AGCTATAGCT GCGGAGGATA CCTATACCGC TGGACCTTTA
AGAGTGAAAT TAGTGGCAAT TTTCAATGGT CAGGTTATTT TCAGCACTTA A
 
Protein sequence
MATGYILIAA ILILGGVIAT VGDRIGTRVG KARLSLFKLR PKNTAVLVTI FTGGLISAST 
LAILFAADEG LRKGVFELED IQKDLRNKRE QLKTAETEKS QVEKQLTEAR KEQTQAQQDL
QNINQSLQAA NAKQRITQAQ LNRTISQQAK TQAQLQGTQS RLGEVVIQYK QARTELQTLY
NQRQALQTAV EELKTERQRL YAEAKKAIDE AKTAIEKRDQ ELANRQKAIE KRDQKIAKLD
QLIQNRNLEI KKREQVIATR ESRLKELEQQ QDYLEQEVAR LEKYYQSYRD LRLGKLALVR
GQVLAAGVVQ VNQPTAARQV LVQILQEANR NANIELSEPG SNPGNAELLR VTQDRVEQLI
NQIDDGREYV VRIFSAGNYV RGEKQIEFFA DATRNQLVFS TGQILATTAA DMKNMTSYQL
RQRLDLLISA SQFRARNAGI IETVQVEGTF LRFFAQLQQS NQPLEIKAIA AEDTYTAGPL
RVKLVAIFNG QVIFST