Gene Aazo_2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2048 
Symbol 
ID9339840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2126491 
End bp2128359 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content41% 
IMG OID 
Productchaperone protein DnaK 
Protein accessionYP_003721226 
Protein GI298491049 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00388714 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAG TAGTAGGAAT TGATTTAGGA ACTACAAACT CTTGTGTTGC CGTGATGGAA 
GGAGGGCAAC CTTTAGTAAT TGCTAATTCT GAGGGGCAAC GTATTACTCC TTCGGTTGTA
GCTTACACAA AAACTGGTGA ACGCTTAGTT GGTCAAATTG CTAGACGACA AGCGGTAATG
AACCCAGAAA ATACTTTTTA TTCTGTGAAA CGTTTCATTG GGCGCAAATA TGAAGAAATT
ACCCATGAAG CCACGGAAGT TTCTTATAGA ACTCTGCGGG ATAGTAATGC CAATGTAAAG
CTCAATTGTC CCGCTGCAGG TCAAGAATTT GCACCAGAAG AAATTTCTGC CCAAGTGCTA
AGAAAGTTAG TTGATGATGC TAGTAAATAT TTAGGAGAAA AAGTTACTCA AGCGGTGATT
ACAGTTCCTG CTTATTTTAA CGATTCTCAA CGGCAAGCTA CTAAAGATGC AGGTAAAATT
GCTGGGTTAG ATGTTCTCCG CATTATCAAT GAACCAACAG CAGCAGCTTT GGCTTATGGT
TTGGATAAAA AGGAAAATGA AACTATCTTA GTATTTGACC TTGGTGGCGG TACTTTTGAT
GTTTCTATTT TGGAGGTTGG GGATGGTGTA TTTGAAGTTA AATCTACTAG TGGGGATACT
CATTTAGGTG GTGATGACTT CGATAAAAAG ATTGTTGATT GGTTAGCAAA TCAGTTTCAG
AGTAACGAAG GCATTGACTT ACGTAAGGAT AAACAAGCTT TGCAAAGATT GACTGAAGCT
GCGGAGAAGG CAAAAATTGA GCTTTCTAGT GCAACTCAAA CTAATATCAA TTTGCCCTTT
ATTACTGCTA CTCAGGCGGG TCCAAAACAC TTGGATATGA TGCTGACACG GGGTAAATTT
GAGGAGATGA CAGCCGACCT TCTCGACCGT TGTCGTAAAC CAGTCCAACA AGCATTGCAA
GATGCAAAAC TCAGTAATGC TCAACTTGAT GAAATTGTTT TAGTTGGTGG TTCGACTCGC
ATTCCCGCTG TGCAAGAACT GGTGCGACGG ATGACGGGTA AAGAACCTTG TCAAGGTGTA
AATCCTGATG AAGTTGTAGC GGTGGGTGCG GCTATTCAAG CGGGTGTTTT ATCCGGTGAA
GTCAAAGATA TTTTACTGCT TGATGTTACG CCGTTGTCTT TGGGTGTGGA AACTATTGGC
GGTGTGATGA CTAAGATTAT TAGCCGCAAT ACAACTATCC CGGTGAAGAA ATCAGAAGTC
TTTTCTACGG CTGCTGATGG TCAAAGTAAT GTGGAAGTTC ACGTTTTGCA AGGTGAGAGG
GAACTAGCTA AAGATAATAA GAGTTTAGGT ACTTTCCGTT TGGATGGTAT TCCTCCAGCA
CCGAGAGGTG TACCGCAAAT TGACGTTACT TTTGACATTG ATGCTAACGG TATTCTTTCT
GTTACTGCTA AGGATAAAGC CACGGGCAAA CAGCAGTCCA TTTCTATTAC AGGTGCTTCT
ACTCTCGATA AGCGGGATGT AGAAAAGATG GTGCGGGATG CAGAATCTCA TGCGGAGGAA
GATAGAAGAC GACGTGAACA AATTGATACT AAAAATTTGG GTGATTCTTT AGTTTATCAA
GCTGAGAAAC AACTCAGAGA CTTGGGTGAT AAGGTGAGTG CTGTTGATAG AGGACGAGTT
GAAGATTTGG TCAAGGATTT GGCGGAAGCT ATCAATCAAG ATCATTTCGA TCGGATTAAG
TCTCTGAGCA GTCAATTACA GCAAGTGTTG ATGCAGGTTG GTAGTACGGT TTATGCACAA
GCAGGAAGTT CTGATGGAAG TAGTAGGAGT GAAGATGTGA TTGATGCTGA CTTTGTAGAA
AATAAATAA
 
Protein sequence
MAKVVGIDLG TTNSCVAVME GGQPLVIANS EGQRITPSVV AYTKTGERLV GQIARRQAVM 
NPENTFYSVK RFIGRKYEEI THEATEVSYR TLRDSNANVK LNCPAAGQEF APEEISAQVL
RKLVDDASKY LGEKVTQAVI TVPAYFNDSQ RQATKDAGKI AGLDVLRIIN EPTAAALAYG
LDKKENETIL VFDLGGGTFD VSILEVGDGV FEVKSTSGDT HLGGDDFDKK IVDWLANQFQ
SNEGIDLRKD KQALQRLTEA AEKAKIELSS ATQTNINLPF ITATQAGPKH LDMMLTRGKF
EEMTADLLDR CRKPVQQALQ DAKLSNAQLD EIVLVGGSTR IPAVQELVRR MTGKEPCQGV
NPDEVVAVGA AIQAGVLSGE VKDILLLDVT PLSLGVETIG GVMTKIISRN TTIPVKKSEV
FSTAADGQSN VEVHVLQGER ELAKDNKSLG TFRLDGIPPA PRGVPQIDVT FDIDANGILS
VTAKDKATGK QQSISITGAS TLDKRDVEKM VRDAESHAEE DRRRREQIDT KNLGDSLVYQ
AEKQLRDLGD KVSAVDRGRV EDLVKDLAEA INQDHFDRIK SLSSQLQQVL MQVGSTVYAQ
AGSSDGSSRS EDVIDADFVE NK