Gene Haur_1083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1083 
Symbol 
ID5732872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1239451 
End bp1240800 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content53% 
IMG OID641278221 
ProductUspA domain-containing protein 
Protein accessionYP_001543859 
Protein GI159897612 
COG category[T] Signal transduction mechanisms 
COG ID[COG0589] Universal stress protein UspA and related nucleotide-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000280681 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTACT CAATCGTGAT CATTGACCCA GATCAAGGTT CAGCCCAAAC AACTGCCGCG 
CGATTTCAGC GGCGTTGGGG CCAGAACGTC CAAGTCAATA TTGTTAAACA AGCTGATTGG
GCTAGTGTTC AGGCTCATAG CCCCAACCTA GTCGTGATTG ACCCCGCGCC ATATCGGCTG
CATGGGTTGC GCTTACTCGA ACAACTTTGC GAAGAGCAAC CATCAACAGC CATTGCCGTG
GTCGCTTCGG GTAGCTCCCC AAGCATGCGG CAACGCCTAC GCAACTTGCC AATCACTTCA
TATTTGGAAA AACCTAGCTC GTTAGCCCCC CTACTCGGTG AACTTGATCA CCTTGTTGAG
AGCAATGTCG CTCTCCAATC CAAAGGAGGA TTGGTTATGG AACGCCAAAT GTTGATTCCA
CTTGATGGTT CACTGTTAGC GGAACAAGCT TTGGATTATG CGGTGGTTTT AGCTCGCCGC
AACAGCAGTG TTTTGCATTT AGTTCGGGTG ATTGGCTATC CACCGCTGGT TCCGGCTTAC
GAATGGCCAG TGCCAAGCGC CGTTGATAGC CGTCAATGGT TAGCCGACGA ACGTCAAGCC
GCCCAAACCT ACCTTGATGA ACTTAAAGCT CGTTATGAGC AACAAGGCTT AGCGGTACGA
ACAACCGTGC TTGATGGCGA GCCAGCTCAT GCAATTGTCC AATTTGCGAC CGAACAAAAT
AGCGTGCGCG AAATTGTGCT CGCCAGTCAT GGTCGCAGTG GGCTTGGGCG TTGGGTACTC
GGTAGCATCG CGGAAAAATT AGTCCAAGCT ACGCCAGTGC CAATTTTGGT CATCCATGGC
GATGAACGCA AAGTCGAAAC CTTTAAAATT CCGCCAGAAT TGCGCACGAT TGTTGTGCCA
CTCGACGGCT CAGCCATTGC TGAACAAGCT TTGCCCTTGG CCAGCCAACT AGCCGAGGCC
CATAGTGCTG AACTAGTCTT GCTGAGTGTC ACCCCGGGCA TCGACGATCC TGGTTTGATC
GAATCGGGAC TTGTGCCAAT GTGGAGTGCT GGCGAAAAAG CTCAAGCCCG TGATCAAGCC
CAGAAATATT TGCAACAGCT TGAACAATCG TTGCAAACGC CACGGCTACG TCTCCGCCAT
CTCGTCGTGA GCGGCACACC CGCCGAAATG ATCGACGAAA TTGCCCAAGA AGCAGTTGCC
AGCATGATCG TTATGGCAAC CCATGGACGC AGTGGCTTCA GTCGCATGTG GATGGGCAGC
GTAGCAACCA AGCTCATTCG CAGCAGCCAA CGGCCAATCT TCTTGGTACG GGCGGTCGAA
CAAGCTGCCG AACGCGGCCA ACCACTCTAA
 
Protein sequence
MSYSIVIIDP DQGSAQTTAA RFQRRWGQNV QVNIVKQADW ASVQAHSPNL VVIDPAPYRL 
HGLRLLEQLC EEQPSTAIAV VASGSSPSMR QRLRNLPITS YLEKPSSLAP LLGELDHLVE
SNVALQSKGG LVMERQMLIP LDGSLLAEQA LDYAVVLARR NSSVLHLVRV IGYPPLVPAY
EWPVPSAVDS RQWLADERQA AQTYLDELKA RYEQQGLAVR TTVLDGEPAH AIVQFATEQN
SVREIVLASH GRSGLGRWVL GSIAEKLVQA TPVPILVIHG DERKVETFKI PPELRTIVVP
LDGSAIAEQA LPLASQLAEA HSAELVLLSV TPGIDDPGLI ESGLVPMWSA GEKAQARDQA
QKYLQQLEQS LQTPRLRLRH LVVSGTPAEM IDEIAQEAVA SMIVMATHGR SGFSRMWMGS
VATKLIRSSQ RPIFLVRAVE QAAERGQPL