Gene Haur_5261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5261 
Symbol 
ID5737219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp34321 
End bp35997 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content49% 
IMG OID641282425 
Productleucine-rich repeat-containing protein 
Protein accessionYP_001548016 
Protein GI159901771 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATCAGA TTCATGCTGG GTATATTATG CCCGAGGATG CGACGCAGAC CACCCTTGAT 
TTTAGCCGCT TAAGCCTCAC AATCCTTCCG ACGATGCAGG ATGTTTCTTG GAATATTACG
GCAATCGATT TATCCTATAA TAGCTTAACG ATGCTCCCTT ATGCGCTTCC GCGTGCGGCA
TCGCTGAAAC GCCTTCTGTT ACGTGTTAAT CCGCTCACTG CGCTTCCTGA ATGCATCCGC
GAATGCCACA ACCTTGAAGA ACTCTATGTC TCAGGCTGTC CCTTGACCAT GTTGCCAGAT
TGGTTGGATG AACTAACGGC CTTGCGAATA CTGGAGATCA GTGACACCGC CATTCCTGAT
TGTCCCTCGG TATTGCGTCG ATTGCCGAAT CTTCGGATGC TTGGCATCGC AAATCTCCCA
TGGACAACCC TACCGCCATG GTTTTGTGAC TTACCGCTGA CAACCCTCAC TATCGATGGC
ATGCCTAGAT GTGATTGTTC TCCCCTCGTA GGATTAAAAC AGCTTCAGCA TCTTGGCCTC
AGTGCCATGG ACTATACCAT TGTGCCTGAA TGGATTCGAC AATTGCCTTT GTTACACCTG
CTTGATCTCA GTCATAATCC CCTTGAAATA CTCCCATCTT GGCTAGAAAC CATCCCTATA
ACAACCCTCA TGCTTGCCCA TGTTCCATTA GCGGCGCTTC CCGATTGGCA TACGTGGGAT
CGGTTAACCA CACTCGATTT AACGGCATGT CATTTGGCTG ATGCTGCCTT TCGAATGCCC
TTGCCGCGCA ATTTAACCCA CCTGAATCTT GATCAAAATC CCATTACTCA GCTTCCATCA
GAGCTTTACC AGTGTCATTC ACTCTACGCT TGTAGTCTTG CGAATACCGC TCTAACCACC
CTTCCAGCAT GGTTTTTTGA AGATCTTCCA CTTGTATCAC TCGATATTTC AGGCACCAGC
CTCACGTTTC CGGCGCTCTC CCAGCGATCG ATGCTGGAAT CATTCATCTT TGGCATGGGG
AAAACGTCCG CTTGGCCATA CCTTCTCACA CACATGCCGA CATTACGCGT GCTTGATCTT
TCTGATACGT GGTTGCAGTC TCAAAGTCCA TCTATGGGTC ACGCGCTCTT TCCGCAGCTT
GAGACATTTC GTGGACCACG CGATCAAGAC CGTGTTCCTT TAATAGGAGC TATGCCAAAT
CTGCGTGTGG CAGTGTTAAG TGGTGGATTA TCACGAGGTT CGCGTGAGTA TCTTTCTGCA
CTCTTAGGAA AGAGTCCTCA GATCCAAGCA CTTGACCTTT CGCGTTGGCA TTGCAATCCG
ATCCCTTCTA CCCTTGTCGA TCTTGCCGAG TTGCAGACCT TGAATATTGC GCATAAGCAC
CTCGATCAGG TTCCTGGATG GGTGAACGAT ATGCCATACC TTAAATCACT GGATCTTTCT
GATAACCGAT GTACTGACAT ACCACGATGG ATGAGGAACA TGACACACCT TGAATCGCTT
GATCTTTCCG GGAATCCTTT ACAGACCTTT CCCTCATGGT TAAAGGATAT CCCAACGCTC
AGAGATGTCG CATTCATGTT TCCATCAGTC AACCTTCAGT GTGATCACGT CCTCCCAGAA
TTTCTGGCGG CAGGGATTCG CCTGGATGTC CAATATCCCC GTGATGACGC TGAATGA
 
Protein sequence
MYQIHAGYIM PEDATQTTLD FSRLSLTILP TMQDVSWNIT AIDLSYNSLT MLPYALPRAA 
SLKRLLLRVN PLTALPECIR ECHNLEELYV SGCPLTMLPD WLDELTALRI LEISDTAIPD
CPSVLRRLPN LRMLGIANLP WTTLPPWFCD LPLTTLTIDG MPRCDCSPLV GLKQLQHLGL
SAMDYTIVPE WIRQLPLLHL LDLSHNPLEI LPSWLETIPI TTLMLAHVPL AALPDWHTWD
RLTTLDLTAC HLADAAFRMP LPRNLTHLNL DQNPITQLPS ELYQCHSLYA CSLANTALTT
LPAWFFEDLP LVSLDISGTS LTFPALSQRS MLESFIFGMG KTSAWPYLLT HMPTLRVLDL
SDTWLQSQSP SMGHALFPQL ETFRGPRDQD RVPLIGAMPN LRVAVLSGGL SRGSREYLSA
LLGKSPQIQA LDLSRWHCNP IPSTLVDLAE LQTLNIAHKH LDQVPGWVND MPYLKSLDLS
DNRCTDIPRW MRNMTHLESL DLSGNPLQTF PSWLKDIPTL RDVAFMFPSV NLQCDHVLPE
FLAAGIRLDV QYPRDDAE