Gene Haur_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1039 
Symbol 
ID5732943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1185827 
End bp1186867 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content51% 
IMG OID641278174 
Productoxidoreductase domain-containing protein 
Protein accessionYP_001543815 
Protein GI159897568 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTG CAATTATTGG AACCGGTTGG GGTGCTCGCG TTCAGGTGCC AGCCTTTCGT 
TCGGCTGGGC TGAAAATCGT GGGGATCGCC GCCCAAAATT ATGAAAAAAC TCAGCGTGAA
GCTGCCACTT TGAATGTTGA AGCCTTTGAA CATTGGCGTG ATTTGCTCAG CAGCGATGCC
GATTTGATTT CGATTGTGAC CCCGCCAGGG ACGCATTGCG AAATCAGCGT AGCGGCCTTA
GAAGCTGGCA AGCATGTGTT GTGCGAAAAA CCAACAGCAT TAAATGTGCT CGAAGCCCAA
ACCATGCTCG AAGCCGCCCA AGCCCATCCT GAACAATTAA GTTTGATCGA TCATGAATTA
CGCTTTTTAC CAATTTTTCA AATGGCGCGG GCGTTGATTA ATGATGGTGC GATCGGCCAG
ATTCGCCATG TCAATAGCAG CGTGATCTTC TCGTCGCGAG CTGACCCGCA ACGTCCTTGG
AACTGGTGGA GTGATAAAGA GCAAGCTGGT GGTGCTTGGG GTGCGATTGG CTCACACCAA
ATTGATATGT TGCGCTGGTT GTGTGGCGAT TTTAGCTCAA TTCGCGCAAG CTTGCACACC
TTTGTAACTG AACGACCACT CGACGATCAA CTCTTGCCTG TCACCAGTGA TGATTTTGCC
ACGGCTCAAG TGCGTTTGGC GAATGGTGGT TTTGCCTCAA TTATGATTAG TGGCGTGGCG
GCACTCAACG AAAACGATCG TATGATTATT CATGGCGAAC ATGGCGCGAT CAAAATTGAA
GGCGCTCGTT TGTGGCATGC CGAGCGTGAT GGCGAGTGGC AAGAGCGCAC GCCTGCTCAT
ACGGTAGCGA TTCCAAGCGA AATTAGTGGT AACTTCCCAG TGGGAACGGT CTATCTTGGC
CATGCCTTGA AGGCCTACAG CCGTGGTCAG CTTGATGCGT TGGAGCAAGC CGCCACGTTT
AGCGATGGCT TGCTGACCCA AAGTTTGCTT GATGCTGCTC ATCGCTCCGA TGAAAATGAC
GGTGGCTGGA TTACGATCTA G
 
Protein sequence
MKIAIIGTGW GARVQVPAFR SAGLKIVGIA AQNYEKTQRE AATLNVEAFE HWRDLLSSDA 
DLISIVTPPG THCEISVAAL EAGKHVLCEK PTALNVLEAQ TMLEAAQAHP EQLSLIDHEL
RFLPIFQMAR ALINDGAIGQ IRHVNSSVIF SSRADPQRPW NWWSDKEQAG GAWGAIGSHQ
IDMLRWLCGD FSSIRASLHT FVTERPLDDQ LLPVTSDDFA TAQVRLANGG FASIMISGVA
ALNENDRMII HGEHGAIKIE GARLWHAERD GEWQERTPAH TVAIPSEISG NFPVGTVYLG
HALKAYSRGQ LDALEQAATF SDGLLTQSLL DAAHRSDEND GGWITI