Gene Haur_4431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4431 
Symbol 
ID5736282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5670318 
End bp5672081 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content53% 
IMG OID641281594 
Producthypothetical protein 
Protein accessionYP_001547191 
Protein GI159900944 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGCC ACGATCAGCG TGCAGAGCGC AAAAGTATTT GGCACGATTT ATTAGAACGT 
CGGATCAATC GCCGGACCCT TGTTGCCAGT GGTGCTGCCG CTGCTGCGGT CGCGGCCTTG
CCCTTAGATC TGCAAACTGC TGAAGCGGCG CACTATCATG CTCCATTGTC AGCCCCAGCC
TTGGCTCAAC GCCAAGCTCA AGGTTCATTG CCCTTCAAAC CAATCAGCCC CAGCACCGCC
GACGATTTGA TTTTGCCCGA AGGCTTCCGC TACGATTTGT TGGCCCACCG CGGCCATGAT
ATGGGCGATG GCAGTTTGTT TGGCGAAAAT GCCGATTTCC TGGCGTTCTT CCCAATCGAT
ATGCTCCAAA AAGGCCTCGA CCAAAATCGC CCACAATTTG GCTTTACCCG CAGCGATTTA
TCCAGCACTG ATGGTTTGTT GCTGGTCAAC CACGAATATA TCAACCCTAT GTTTATCTCG
GGCTACACTG GCTCAGGCGC AAAATCTGGC GATCAAATTA ACGCCGAAAA GCATATGGTT
GGCATGAGCG TGATTCGGGT TAAGCGCAAT AGCGATGGCC GTTGGTATTT CGACCAAACT
GATACCGCTC ACAACCGCCG CATCGATGCA ACTACTCCAA TCACCTTAAC TGGCCCAGCC
GCGCAACTTG ATGGTGGCCC GATGGCAATT GGCTCACTTG GCAATTGTTC CGGTGGTGTA
ACACCTTGGG GCACAGCACT GAGCTGCGAA GAAAACTTCC AAGATTATCC AAATCCAGCA
CCAACTGGCT ATGGCTGGGA ACCAGAAATC TACGGCAAGC GCCACTACGG TTGGGTCGTC
GAAGTTGATC CATTCGATAA AAACAGCATG CCACGCAAAC ATACCGCCAT GGGTCGCTTC
CGCCACGAAA ATGTAGCGGT ACGAGTTGGC AGCGACGGCA CGGTTGTAGC CTATATGGGC
GATGACAAAG CCGATTCATG CGTCTATAAG TTTGTGGCTG ACCGCAAATT GACCAACTTG
GCAGATCGCC CAGGCAATAT GCAAATTCTC GAAAGCGGCC AACTCTATGC CGCCGACTTT
GCCAATGGCA AGTGGATTTT GCTCGATTAC AATAGCCAAA GCGCCTTGCA AAGTGCCAAA
GATAGCAAAG GCAATTTGCT ATTTAGCTCG CAAGCCGATG TTTTGGCCGA TACCCAAGCC
GCTGCCATGG CGCTCAAAGC CACGCCCGTT GATCGCCCAG AAGATATTGA AATTCACCCA
CTCGATGGCA GTGTCTATGT TGCCTTGACC AATAATACTG GCCACGGCAA CTTCCACGGC
CAAATCGTGC GCATGGCCGA AACCGACAAT AATCCAGCTG CAACCAGCTT CGAATGGAGC
ATCTTCGCGG TTGGTGGCTC GCAAAGCGGC TTCTCATCGC CCGACAATTT GGTGTTCGAT
GGCGAAGGCA ACTTGTGGAT GGTAACCGAC ATCTCATCAT CACGCACCAA CAAAGGGATC
TACAAATTCC AAGGCAACAA CGGTCTCTTC TTCTTCCGCA CCAGCGGCCC TGATGCTGGG
ATCGCCTTCC AATTTGCCTC CGGGCCAGTG GAAAGCGAAA TGACTGGGCC ATGCTGGTCG
CCTGATGGCC GAACCCTGTT CTTGGCGATT CAACACCCAG GTGAAGAATC CAAGAGCTTG
ACCGAACTGA GCAGCCACTG GCCAATTGGT GGTAACGAAG TGCCGCGCTC AGGGGTTGTC
GCAATTACCG GGTTCAAGCG CTAG
 
Protein sequence
MTSHDQRAER KSIWHDLLER RINRRTLVAS GAAAAAVAAL PLDLQTAEAA HYHAPLSAPA 
LAQRQAQGSL PFKPISPSTA DDLILPEGFR YDLLAHRGHD MGDGSLFGEN ADFLAFFPID
MLQKGLDQNR PQFGFTRSDL SSTDGLLLVN HEYINPMFIS GYTGSGAKSG DQINAEKHMV
GMSVIRVKRN SDGRWYFDQT DTAHNRRIDA TTPITLTGPA AQLDGGPMAI GSLGNCSGGV
TPWGTALSCE ENFQDYPNPA PTGYGWEPEI YGKRHYGWVV EVDPFDKNSM PRKHTAMGRF
RHENVAVRVG SDGTVVAYMG DDKADSCVYK FVADRKLTNL ADRPGNMQIL ESGQLYAADF
ANGKWILLDY NSQSALQSAK DSKGNLLFSS QADVLADTQA AAMALKATPV DRPEDIEIHP
LDGSVYVALT NNTGHGNFHG QIVRMAETDN NPAATSFEWS IFAVGGSQSG FSSPDNLVFD
GEGNLWMVTD ISSSRTNKGI YKFQGNNGLF FFRTSGPDAG IAFQFASGPV ESEMTGPCWS
PDGRTLFLAI QHPGEESKSL TELSSHWPIG GNEVPRSGVV AITGFKR