Gene Haur_3055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3055 
Symbol 
ID5734927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3859317 
End bp3861164 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content52% 
IMG OID641280199 
Producthypothetical protein 
Protein accessionYP_001545821 
Protein GI159899574 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCTA GGTCGATTAA TCGCCGGCTA TTGGTGCTTC AAGCAGGCTG GACAGTTGCT 
CAGGCCCAGT TATTGCTCGC CCATAGCCAA GCCGAATATG TGGTTGTTCA ACGCACTGAG
CCACAAATCT ACTGGTATGT CTACCCTCTC GAAGTTGTCC AAGAACGCTT AAGCTACCAT
CACCCTGATA TTGCGATCTA CCTGGCGCTC AATTTGCAAG AAACCTCAGC TAGTTCGACT
CTCAACAGCA ATCAACTCGA AACCGCCGAC TATTTAAGCG TGGTGCTTGA TGATCAGCAA
CACTTGCAAG GCGTAGTTAA TCCGAGTGCT CAAGCCAAAG GCACAGAATT TGATCCATTC
TTCAAGGCCT ATCCCTCAGT AGTTGCCCAA AACCAAGCCC AACTTAACCA AGCCTTTGAT
CTAGCGGTTG GCTTTCGCGA TACGCCTGAT GCTGGCTTAA TCGGTGGCCA TAACCCGATT
GTCATTCATG GCTTGCAAAC TGATGAACGT TGCACAATTA TGCTCAGCGG CGATGGCTTG
CAGTTTGATC GAGAGCAAGC CGAATTGGCC TTTGACATTC AGGCAACCGT ATTTTTTAAG
GCCACGCCAA CCCGCACAGG TCGCTGTGTA ATCTATGTCG ATTATTATCG CCAACGCCAA
TTGGTGGGCC ATGCCGAGCG GGTTGTGCTG GTCGATAGCA ACGCTGAACC AGAACCTAGC
AATGCTAGCC CGTTTGATTT TGGCTCAACT CCGGTCGATT TACTGATCAA CCTGCGCCGT
GATGGCGATA CGTTCAAATG GACGGCAATG CCGCATGATC AAGCCTTTAC ACCGGTGCAC
AATTTGCCGA GCCAACAAGC CTTATCCGAG CAGGCTGCCC AAAATTGTGC CGTTGATCTG
TTGGGTGCGG CGGTTAATCC AAGTTTATTG TTGGCGCAAC GCGAACTTGA AGCGCTTGCC
AGCGATCTAG GCCAATTTGT CCCAAGCCCA ATTTGGCAAT TACACAGCGA TTTGGCCCAG
AAATTGCAGC GTCCACTCAC GGTTTTGCTG CGCAGCAACG ATTTGGCTTT GCCTTGGGAA
TTAGCGATGG TCGAAGCCCC TTTGCTAGCT GGCGATCAGC CGCTGTATTG GGCCGCCCAA
ACCCATTTTG CCCGTTGGTA TATTCACCCC CAAGTTAGCC CAATGCCACC CGATCAACTC
AACATTAGCC AAATTAGCGC AATTGCCTCA CGCTATGGCT GGGATTCAGG CCAAGCTGAA
TTGGTGCATG CAGTTGATGA GCAAACCATG CTGCAAAACC AATGGCAAGC CCAAGCCTAC
GAAGCCACGA TTCAGGCGCT TGATCCATTG TTGAGCCAAG CCACGACCCA AGCTGGCCAT
CTTTTACACT TTGCCGTGCA TGGCCGCAGC CAACCCAACG CCCGCATTCA AGAAATTATC
TTGGCTGATA ATAATGCGAT TTCGGCCAAA GCCTTGGTGG GCAATACTCG CCGTCGCCCG
CCCCAATTTA GCTTTGTGTT TATCAATGCC TGCCAAGTTG CGACCCCAGG CCAGAGCTTA
GGCCAAGCGG CGGGCTTCCC CGCCGAAATT CTCAAAAGTG GTGCGGCGGG CTTTGTTGCA
CCATTGTGGG AAGCTGATGA TCAAGCAGCC GGAACGTTTG CCGCTCAATT TTATAGCCAA
GCGTTTCAAG CCCAACCGTT GGGCGCAATT TTGCAACAAT ATCGCCTAAG TTATGTGGCC
AATAGCACCA CCACCCGCCT TGCCTATATC TTTTATGGCC ATCCCGCCTT GCGTTTGGCC
TATTCGAGCA AAGGAGCAAC CCATGCCCAA CAACCAAGTG CGGCTTGA
 
Protein sequence
MSSRSINRRL LVLQAGWTVA QAQLLLAHSQ AEYVVVQRTE PQIYWYVYPL EVVQERLSYH 
HPDIAIYLAL NLQETSASST LNSNQLETAD YLSVVLDDQQ HLQGVVNPSA QAKGTEFDPF
FKAYPSVVAQ NQAQLNQAFD LAVGFRDTPD AGLIGGHNPI VIHGLQTDER CTIMLSGDGL
QFDREQAELA FDIQATVFFK ATPTRTGRCV IYVDYYRQRQ LVGHAERVVL VDSNAEPEPS
NASPFDFGST PVDLLINLRR DGDTFKWTAM PHDQAFTPVH NLPSQQALSE QAAQNCAVDL
LGAAVNPSLL LAQRELEALA SDLGQFVPSP IWQLHSDLAQ KLQRPLTVLL RSNDLALPWE
LAMVEAPLLA GDQPLYWAAQ THFARWYIHP QVSPMPPDQL NISQISAIAS RYGWDSGQAE
LVHAVDEQTM LQNQWQAQAY EATIQALDPL LSQATTQAGH LLHFAVHGRS QPNARIQEII
LADNNAISAK ALVGNTRRRP PQFSFVFINA CQVATPGQSL GQAAGFPAEI LKSGAAGFVA
PLWEADDQAA GTFAAQFYSQ AFQAQPLGAI LQQYRLSYVA NSTTTRLAYI FYGHPALRLA
YSSKGATHAQ QPSAA