Gene Haur_4693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4693 
Symbol 
ID5736540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5995222 
End bp5996808 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content46% 
IMG OID641281857 
Productleucine-rich repeat-containing protein 
Protein accessionYP_001547452 
Protein GI159901205 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAGC AACTTTGCCG GGCGCGGATT GCCCAAAATG CCCAAACCCG TGAACCAACC 
CTCGATCTTT CATCACTCAA CCTCACCAAC CTACCAGAAA CGATTGGCGA ATTAACTCAT
CTTGAAGCAT TAAATCTTGC CTGCAATCGT CCGCTGCAAC TACCGCCTGA ACTTGCAAAC
CTAACCAAGT TGCGCAAATT GGATCTCAGC TTTCCCCACC AATCTATAAT TCCAGCATGG
CTCGATCAAT TAACCAGCCT CGAAGAATTA GATATTCGGG CAAACCCGAC CACTGGCATT
CCCGAGGTTT TAACGCGATT ACCACGTTTG CAGAAACTTA ATCTTTATCT TGATGGATTT
GAAGCATTGC CGAGCGAATT GCTTAATCTA TCAACGCTAC ACAATATTAC GATTGGTTCA
ACAAAACTAA CCAGATTACC TGATTGGTTC AGTGATTTGC GCATCACAGC TTTAGAATAC
TATCTCAATT CTATTCCTAA CGAGCACTTA ATTGGGGCGA TAGACTCGCT TCAAGTGCTG
GATCTACAGC TTTACTATGG GAAACCTACA GCTTTTCCCG CATGGCTGAG GCAGATGCAT
CATTTACGCT GGCTTCGTTA TAACAGCCAA GCACTGGTGC CACCTTGGTT AATCGAATTA
CCGCAACTCT CGTATTTGGA AAGCAACGCC GATTTTAGCG AAATCAGCCA AGTTTGGCAT
TCATGGGAAT CACTTGAACA GCTCAAGTGT GGTTATGTTG ACGCGACGAC CTTGCCACCA
AACCTCAAAA CCTTAGAAAT ACCCATCAGC GGATCAGCAA TTCCTGAATC AATTCGCCAA
GTGCGCCAGC TTGAAGTGCT TCATTTATCG GGTAAAGGGT TTCGCGAGTT GCCTGGCTGG
GTTTTAGCAT TGCCCAACTT GCACACCTTG GATCTTATCA GCACAGAGAT TGATTACATC
CTAGCGCCTG ACCAGCCGAA CAATAGCCTA CGAAAACTTA TGATGCATAC CCTCTACTGT
GGTCGCAATC ACCGCTTGGA TGGTCTACGC AGCCTGCATA GGCTGGAAGA ATTAAGCCTG
AGTAATCATC GTCTCGGCCA GCTACCCGCA TGGCTCTCCG AATTGCAGCA TTTACGCGAG
TTGTCGATTG ATGATTGCGA GTTGACCGAT CTTGATCCAA GTTTGGGACA ACTTCATCAG
CTTGAAGCAC TCTATCTTCA TGGCAATGCT ATTCCGGTCG CGAGCTTAGA GTTAATGTTC
CCCCGATTAA CCAAACTCCA GCATTTATCA TTTGGGGTCG CAAACGATGA GTCATTTCCC
GCTAGCCTTC GCCAATTGCA TCAACTGCGT AGCTTGTATC TGAGAATCGG GCCAGAGCAC
AGCATCCCTG AATGGTTGAA TCAATTGACC AAACTCGAAA GCATTATGCT TGGCTATAAC
ATTCAGCCAA CACAGATCCC TTGGATCGAA GGCTGGCTGG CACTGCCGAA ATTACGCGAG
ATCGATATTC ATATCAAGCC AGAATTGTTT GATCCTGAGT TACTACAACG CTTTACCCAA
CGCGGCGTAA AAGTCAATCT TGGCTAA
 
Protein sequence
MSEQLCRARI AQNAQTREPT LDLSSLNLTN LPETIGELTH LEALNLACNR PLQLPPELAN 
LTKLRKLDLS FPHQSIIPAW LDQLTSLEEL DIRANPTTGI PEVLTRLPRL QKLNLYLDGF
EALPSELLNL STLHNITIGS TKLTRLPDWF SDLRITALEY YLNSIPNEHL IGAIDSLQVL
DLQLYYGKPT AFPAWLRQMH HLRWLRYNSQ ALVPPWLIEL PQLSYLESNA DFSEISQVWH
SWESLEQLKC GYVDATTLPP NLKTLEIPIS GSAIPESIRQ VRQLEVLHLS GKGFRELPGW
VLALPNLHTL DLISTEIDYI LAPDQPNNSL RKLMMHTLYC GRNHRLDGLR SLHRLEELSL
SNHRLGQLPA WLSELQHLRE LSIDDCELTD LDPSLGQLHQ LEALYLHGNA IPVASLELMF
PRLTKLQHLS FGVANDESFP ASLRQLHQLR SLYLRIGPEH SIPEWLNQLT KLESIMLGYN
IQPTQIPWIE GWLALPKLRE IDIHIKPELF DPELLQRFTQ RGVKVNLG