Gene Aazo_4104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4104 
Symbol 
ID9341909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4173540 
End bp4175024 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content37% 
IMG OID 
ProductNADH:ubiquinone oxidoreductase complex I intermediate-associated protein 30 
Protein accessionYP_003722674 
Protein GI298492497 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.185236 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAG AGAATCGTTC TCCATGGGAC TTATACCGAT TCATCAAAAC CCTGACCTAC 
TTTGAGGTAT TTCCTGTTCT GAACTGGCTA CAAAAGTTAT TTCAAGGTCG TCCTGCAAAT
CATCAGAATC CACCAAATCG GGGGAGGAAA ATGGGTATAA TTCTAGTAGC TGGTGCGACA
GGTGGAGTTG GTAAACGAGT AGTAAAAAGA TTACTAACCC AAGGTTATAA AGTTCGCTGC
TTAGTCAGAG ATATTGACAA AGGGCGGTCA ATTATTGGTA ATGAAGTTGA CTTAGTAGTG
GGAGATATTA CCAAACCTGA AACCCTCAAC AGCTTAGTTA TGAGTAATAT CCAAGCAGTT
GTTTGTTGTA CAGCAGTGCG TGTTCAACCA GTAGAAGGAG ACACACCTGA TAGAGCTAAA
TATAATCAAG GTGTGAAATT TTATCTCCCA GAAACTGTTG GTGATACACC CGAAAATGTC
GAATATAACG GAGTAAAGAA CCTAGTAGAA GCAGCAGTAA AATATCTACC AAACACAGGA
GAAAAAGGAA TATTTGACTT TACTCAATCA TCACAAGAAT TAAAAGACAT TTGGGGTGCT
TTAGATGATG TGGTTATGGG TGGTGTCAGT GCCAGCAATT TTCAGATCTT AGAAAAAACT
GCTTTATTTG CTGGTAATGT TTCTACTGCT AATTCAGGAG GTTTTGCTTC TGTCAGAACC
AAGAGTTTCT CACCAGCAAT TGATTTATCA GGTTATGCAG GTGTAAAATT GCGCGTCAAA
GGTGACGGAC AACGTTATAA AATCTTTTTG CGAACAGAAT CAATATGGGA CGGTGTTGGT
TACAGTTATT CTTTCGATAC TGTAGCTAAT ACATGGATAG ATATTACCAT TCCCTTTGCG
AATTTAACAC CTGTATTTAG AGCTAAATCT GTTAAAAATT GTCCACAAAT TGATGCTAGT
AAAATTTGCT CTTTTCAATT GATGTTGAGT AAATTTGAAT ATGATGGAGC TTTAAATCCT
AAGTTTAATA CTGGTAGGTT TACATTAGAA CTAGAGTCGA TCAAAGCTTA TGGTGGTGAA
ACTTTACCCC AATTTGTTTT AGTTAGTTCT GCTGGTGTGA CTCGTCCCGG AAGACCAGGA
ATTAATTTAG ACGAAGAACC TCCAACGGTA AGATTAAATA ACCAGTTAGG AGGAATTTTA
ACTTGGAAAT TAAAAGGAGA AGATAGTTTA AGAGCAAGTG GAATTCCTTA CATAATTATT
AGACCCTGCG CTTTAACTGA GGCAGATGGA GGAAAAGAGT TAATATTTGA ACAAGGGGAT
AATATCAGAG GGAAAATTAG CCGGAATGAT GTGGCGGAAA TTTGCGTTCG ATCTCTAAAA
CAACCAAAAG CACGTAATAT AACTGTGGAA GTAAAAGAGG GAGAAAATAA TCCTAGTTCT
ATCAATTGGG AACATTTATT TTCTAAATTA AAATCTGATG AATAA
 
Protein sequence
MTEENRSPWD LYRFIKTLTY FEVFPVLNWL QKLFQGRPAN HQNPPNRGRK MGIILVAGAT 
GGVGKRVVKR LLTQGYKVRC LVRDIDKGRS IIGNEVDLVV GDITKPETLN SLVMSNIQAV
VCCTAVRVQP VEGDTPDRAK YNQGVKFYLP ETVGDTPENV EYNGVKNLVE AAVKYLPNTG
EKGIFDFTQS SQELKDIWGA LDDVVMGGVS ASNFQILEKT ALFAGNVSTA NSGGFASVRT
KSFSPAIDLS GYAGVKLRVK GDGQRYKIFL RTESIWDGVG YSYSFDTVAN TWIDITIPFA
NLTPVFRAKS VKNCPQIDAS KICSFQLMLS KFEYDGALNP KFNTGRFTLE LESIKAYGGE
TLPQFVLVSS AGVTRPGRPG INLDEEPPTV RLNNQLGGIL TWKLKGEDSL RASGIPYIII
RPCALTEADG GKELIFEQGD NIRGKISRND VAEICVRSLK QPKARNITVE VKEGENNPSS
INWEHLFSKL KSDE