Gene Haur_0017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0017 
Symbol 
ID5736851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp20847 
End bp21971 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content49% 
IMG OID641277138 
Producthypothetical protein 
Protein accessionYP_001542797 
Protein GI159896550 
COG category[S] Function unknown 
COG ID[COG2311] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATTG TATCCCGTTC CAAGACCACA AATGTGGTCA AAGATTCGCA ACGGATTGTT 
GGTTATGACG TAGCACGAGC CTTGGCAATT TTAGCGATGG TGATTGTGCA CTTCGCATTA
ACCTTCGTCA ACATCGAACA ACCCACAACC AATGGCCTGC TTAGCTTGAT TGTGCAGTTT
TGCGAGGGTC GCGCGGCGGC CTTGTTTGTC GTTTTGGCAG GGGTTGGCTT GAGTTTGATC
GCCCGTACCC AAGCTAATCC TAGCTCTGAG GCAATTCAAG CCAAACGCTG GACGTTGACC
AAACGTGGCT TATTTCTGCT TGGCTTGGGC TTGCTCAACC TAACAATCTG GCCAGGCGAT
ATTTTGCGGG TGTATGGGGT TTCTTTTTTG CTGATTGCTT GGCTGTTTCA AAGCTCAAAT
CGGCGAATTT TGGGCTTGGC CTTAGGCTTT ATGCTGGCCT TTGTTGGCCT GATGTTGCGC
TTCAATTTCA ACCAAAACTG GGATTGGACA ACCCTCGAAT ACACTAATTT ATGGACGATC
AAAGGCGGAT TGCGCAATCT ATTTTTTGAT GGCTTCCGCA GTGTGTTCCC TTGGACTAGC
CTGATTTTCT TTGGTATCTG GCTTGGCCGC CAAAACGTGC AAGCTGCGCA CGTGCGTTGG
CGCTTATTTT GGATTGGGCT AAGTGTGGCG CTGGGCGTAC AAATTGGATC GATTGGATTA
ACCTACGTTT TTAGCAACGT TTGGCCAATA TTGGGTCAGG CAGATGCCGA ATTGTTGTTT
AGCACTGGCT CAATTCCACC AATGCCCTTA TTTTTGCTCT CGGCTGGTGG TGTGGCCTTG
GCAATCATCA TGAGTTGTGT ACAGCTAAGT CAATTATTTG GTGCTAGCCG CATCATTCAT
GGTTTGGCGG CAACTGGCCA ATTGGCGCTG ACCTGGTATA TCGGCCATGT GGTAATTGGC
TTGGGTGTGT TGACCAGCCT TGGTTTTTAT CAGAATCAGA GCCTTGCAAC GTCGCTCTGG
TTGGCCTTAG GCTTCTTTGG ATTAGCGGTT GGCTGCTCGG TTTGGTGGAA AAAACGCTTC
AATAATGGCC CATTGGAAAC AGTGCTGCGT TGGGCCACCA GCTAA
 
Protein sequence
MAIVSRSKTT NVVKDSQRIV GYDVARALAI LAMVIVHFAL TFVNIEQPTT NGLLSLIVQF 
CEGRAAALFV VLAGVGLSLI ARTQANPSSE AIQAKRWTLT KRGLFLLGLG LLNLTIWPGD
ILRVYGVSFL LIAWLFQSSN RRILGLALGF MLAFVGLMLR FNFNQNWDWT TLEYTNLWTI
KGGLRNLFFD GFRSVFPWTS LIFFGIWLGR QNVQAAHVRW RLFWIGLSVA LGVQIGSIGL
TYVFSNVWPI LGQADAELLF STGSIPPMPL FLLSAGGVAL AIIMSCVQLS QLFGASRIIH
GLAATGQLAL TWYIGHVVIG LGVLTSLGFY QNQSLATSLW LALGFFGLAV GCSVWWKKRF
NNGPLETVLR WATS