Gene Aazo_2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2071 
Symbol 
ID9339865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2154299 
End bp2155483 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content44% 
IMG OID 
Product4Fe-4S ferredoxin iron-sulfur-binding domain-containing protein 
Protein accessionYP_003721242 
Protein GI298491065 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTAATC TGTTAGCTTT ACAATCCCTA AAACAAGGTC ACTGGTTCAA GTTGATTTGT 
GGAGCCAGTT TCCAACATCT ACCTGCAGTT AGAAGTTTAA CCTTAGCCTA CACTTTGGCG
GGTGCTGACT GCATAGACGT GGCAGCTGAT CCGGCGGTAG TTGTAGCCAC TCAAGCCGGT
TTACAAGCAG CTAAGGATCT GGCTAGGGAA GCTCAAAAGC AAGGCTTTGG TTTTAAAGGT
GATTTGCCTT TACTCATGGT CAGCCTCAAC GACGGAGAAG ATCCGCATTT TAGAAAAGCA
GAATTTAATG CTAGTAGTTG TCCGGTAGAT TGCCCTAGAC CCTGTGAAAA AATTTGTCCA
GCACAAGCTA TTGTGTTTAA CAATATAAAA GATAGCTTTT CAGGAATTAT TGCTGAAAAA
TGTTATGGCT GCGGTCGTTG CATTCCAGTT TGTCCTTATG AGATGATAAA TACAACATCT
TATATATCAA CTCCGGAAGC TATAGCACCA TTGATCATGT CAAAGGGAAT AGATGCCATA
GAAATTCATA CAAAAGTAGG GCGTTTGGCA GAATTCCAGC GTTTGTGGGT AGCGATCGCT
CCTTGGGCAG ATAAATTAAA GTTAATAGCT ATCAGCTGTA ACGATGGTAA AGGGCTGATT
GATTATCTCC AAGCAATTTA TGACCTGATC ATCCCCCATC CCGAAATTCT AATTTGGCAA
ACAGACGGGC GCTCTATGAG TGGTGATATC GGCAATGGCA CTACTATAGC AGGAATCAAA
CTAGGGCAAA AAGTTTTGGC AGCAAATCTA CCAGGATATG TGCAGTTAGC AGGGGGTACT
AATAGCTACA CTGTTCCTAA ATTAAAGTCC ATTGGATTGC TGAAAGAGTC AGGGGAGCAT
GGAGCGGGGG GTAGGGAGCA GCGGGCAGGG AGTAGCCAAC AGGGAGCAGC CGACAAGGGG
GAAAATACCT CTCCTCTCCA ACGCCAGTCG CCTGCCTTTA TCTCCGGTGT TGCTTACGGT
AGCTATGCCC GTGTATTGCT GTCACCGATT CTTGAACAGT TAGAAAATAA AGAGGTAATT
ACCACCAGTG TTAAAGCAAA TCTACGCCTC GAAGATGAAC CAGAACTACT ATGGCAAGCT
GTGAAACTTG CTCATTCTCT CGTTTCCCAG ATTAAGTCAC AGTAA
 
Protein sequence
MTNLLALQSL KQGHWFKLIC GASFQHLPAV RSLTLAYTLA GADCIDVAAD PAVVVATQAG 
LQAAKDLARE AQKQGFGFKG DLPLLMVSLN DGEDPHFRKA EFNASSCPVD CPRPCEKICP
AQAIVFNNIK DSFSGIIAEK CYGCGRCIPV CPYEMINTTS YISTPEAIAP LIMSKGIDAI
EIHTKVGRLA EFQRLWVAIA PWADKLKLIA ISCNDGKGLI DYLQAIYDLI IPHPEILIWQ
TDGRSMSGDI GNGTTIAGIK LGQKVLAANL PGYVQLAGGT NSYTVPKLKS IGLLKESGEH
GAGGREQRAG SSQQGAADKG ENTSPLQRQS PAFISGVAYG SYARVLLSPI LEQLENKEVI
TTSVKANLRL EDEPELLWQA VKLAHSLVSQ IKSQ