Gene Haur_2056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2056 
Symbol 
ID5733944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2566000 
End bp2567679 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content57% 
IMG OID641279198 
Producthypothetical protein 
Protein accessionYP_001544825 
Protein GI159898578 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.809448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCCGT CTTCGTTTGA TCCGTTCGAT GGTCGCCTGT ATCCACGGCG TACCCCATCC 
CAGACCATGC TTGAGTATCA GCATTTAGAT CTTGCGGTTG TGCCATCGCC GCTGTGCCCG
CAGCGTCCGA TTACCTCTCT TGATCTTTCC CATAATCCGC TGGTCACATG GCCGAGCGAA
ACGGCTGCAT TGCCATCCCT CACCCGATTA AATCTTGCCC ATACCACGCT CACTCAGCTT
CCCGACCATC TTCGCGCGTG TACGCAGCTG GAAGAACTCT ATCTGTCAGG GTGTCCGCTG
GAATGTCTGC CGTCGTGGCT TAGCGAACTT CCCCACCTGC GCGTGCTTGA TCTGTCGCAT
ACGCGCCTCA CCATGGTGCC TGATGTCGTG CGTTCGCTGC CATCCCTCCA GGTGCTCAGT
CTTTCCGGGC TTCCCCTTGC AGCCCTGCCG CCGTGGCTCG ATGCGTGCGC CCTCCACATG
CTGTTCCTGC GGTCATTGAC GGCATGTGAC TTAAGCAGGG TGCGGGCTTG TTCGACGCTC
GAATACCTTG ATCTCGGACA CCTGGACTTG ACCCAGGTTC CCGACTGGAT TCAGGGATTA
CCCCGATTAC AGCAGCTGGA TCTCTCGGAC AATCCGATCA CGGAGCTTCC CGCGTGGGTT
GGCGATCTGC CGCTCACGAC GCTCCATCTT GCCCAGACGC GACTTCAGCA CCGTCCCGAT
TGGGAGGCAT GGACGATGCT GCGTGACCTG AACCTCAGCG GAATGACCCA TGATCCTGCC
GTCTTTGCGG GGGCATTTCC CGCATCCTTG ACAAGCCTGA AGCTGTACGA CACGGCCTTA
ACCGCAATCC CTCCCTTCGT TCGCAACCTC CAGCATCTTG AAACGCTCCG GTTTGACAAC
AATGCATCCT TGTCGCTTCC AGCATGGCTC CTGGAGGAGT GCCCATTAAA AACGCTCGAA
CTGATCAACA CCCACATCAC CGAAATTGCC CCGGTCGCGC AGCCCATCGC CTTAGAACAC
CTGATCATCA CGGCTGGCCG TCTGCCCACG TGGCCGACGC TCCTTGACTA TACGCCACAC
CTGCGGACAC TCGATTTGTC GGAAACGCGG ATCGTCGATG CCACCTGTCC ATCGCCGTGT
GTGCTTCCCC GATTAGTAAC GCTCGATCTT CAAGGCGATG CGATCGCGCA GCTGCTCCCG
CAGCTGGTCG TTCCCATGCT GCAACGATTG ACCATCGCCA ACTGTTGGGA CGCAGACCTA
ACCGCCGTGC TTCAGCAGGT CGGTCAGGTG AAGAATCTTG CCATCTTAAA CTGTTCAGGG
ACGGTACCGG AGGGGCTGCG ATCATGGACC CACCTCCAAA CACTGAATAT GGGTCATAAT
GGGTTGCGTG AGCTACCACG TTGGATCAGC GAATTGGAAC ACCTTGAATC GCTCAACCTC
GCCTATAATG ATCTTGCACG ACTCCCGCTC GCCGTGCGGG AGCTTTCGCA GCTACATACG
CTCGATATCA CGGCGAATCC GCTGCGGAGC TTTCCTGATT GGCTCCATAC CATGCCACAG
CTGCATGCTA TCGAGTTTCA ATTTCCACCG GATGACCTCA CCCTGCATGA TCATCAATTG
CAATTTCTGG CCGCTGGAGT GCGCTGCAAT GTCCGTTCAC CGCGACCACG GAAAGCTTAA
 
Protein sequence
MMPSSFDPFD GRLYPRRTPS QTMLEYQHLD LAVVPSPLCP QRPITSLDLS HNPLVTWPSE 
TAALPSLTRL NLAHTTLTQL PDHLRACTQL EELYLSGCPL ECLPSWLSEL PHLRVLDLSH
TRLTMVPDVV RSLPSLQVLS LSGLPLAALP PWLDACALHM LFLRSLTACD LSRVRACSTL
EYLDLGHLDL TQVPDWIQGL PRLQQLDLSD NPITELPAWV GDLPLTTLHL AQTRLQHRPD
WEAWTMLRDL NLSGMTHDPA VFAGAFPASL TSLKLYDTAL TAIPPFVRNL QHLETLRFDN
NASLSLPAWL LEECPLKTLE LINTHITEIA PVAQPIALEH LIITAGRLPT WPTLLDYTPH
LRTLDLSETR IVDATCPSPC VLPRLVTLDL QGDAIAQLLP QLVVPMLQRL TIANCWDADL
TAVLQQVGQV KNLAILNCSG TVPEGLRSWT HLQTLNMGHN GLRELPRWIS ELEHLESLNL
AYNDLARLPL AVRELSQLHT LDITANPLRS FPDWLHTMPQ LHAIEFQFPP DDLTLHDHQL
QFLAAGVRCN VRSPRPRKA