Gene Haur_2190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2190 
Symbol 
ID5734077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2774370 
End bp2775638 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content45% 
IMG OID641279331 
Producthypothetical protein 
Protein accessionYP_001544958 
Protein GI159898711 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATAG TATCCCTACC AGCTCGTCAA TCGATTCAGC GTTTGCTCGG CGAGCATTTA 
AAAAAACGTC CAGGCCTGTG GCTCTTTATA AGCATGATGT TGGTTAATGC AGGGAATTAT
GGGTTTAATG TTTTGGTCGG GCGCTGGCTT GGGCCATCGG CTTACGCCGA AATTAATTTA
ATGATTACCT TGTTTTTGAT TGCAACCTTT ATTACAAGTG GCTTGCAGAT GAGTGCTGCT
CGAGCAATTG TTGGCTCAAA TAACCAGGCT ATCTTAACCA TGTTGCGACG GGTTGCTTGG
GTTTGTGGTG GCTTAAGCTT GCTCGGTTTG GGGCTTTTTG CAGCATTTTG GCAAACCCTG
TTTCACACAG CTAGTGCCAG CCCATTTATT ATCTTAGGGG TTGGGCTAGC CTGTTATTAT
GCCTTGGGCG TTGAGCGCGG TATTGCCCAA GGATGTGCGC ATTTTGGGCG CTTGGCTTGG
AATTTTCAGA TCGAGATGGC TGTTCGACTG ATGGGAGCAT GCGTCTTTGT TGGATTAGGT
ATGGGTGTCG CTGGTGCAAC GATCGCTTTA AGTAGTTCGA TTGTCATTGC ATGGTTGGAT
AGTAGGAGAT CTTATGCTAC GGTTGAGCAA CATAGAACGA ACAATTCTAT CCAGCTGCAA
ATGATGCTTC CGATTATCAT CCATCTTTTG GGTCAAGTTT TAATTAATAA TAGCGATGTG
TTGTTGGTCA AAGCATGGTT TCCGGCAATG ATTGCTGGCC AATATGCAGC CTTAGCGCTC
ATAGGACGGG TTGTGTTTTT TGCAACAGCC ACCTTAGGTA CCATTCTCTT TCCCCGAGTG
TTACGTTCTA CCCAACATAG CGAACAACAA CGACTTTTCT GGCAAAGTAT TATGATAACC
GTGGCGATTG CGGGACTGAT TACTTTGTTA TGTAAAATTA TACCGCAACT AATCCTTGGA
TGGCTTTTTG GTGAGGCCTA TCTTCCCATC GCAGATTTCT TATGGCTATA TGCGGTGGCA
ACGAGTTGTT ATGCCATTGC TAATATTAGC ATCACGTATC AGCTTGCTCA GGATCGTTCA
TTTGGTGCAT GGATTGCTCT GCTTGCAGGT AGCATTCAAA TCGGGGTAAT GAGCTGGTTC
AATCAGAGTA TTCACCAGAT TTTATATAGC CAAGTTGGAG TGATGCTTGG TTTACTATTG
GTGCTTATGC TCTACGAATT CTATAGGGCT CGTGCACGGC AGAAATTCTC AAAATCCAGG
GTAATTTGA
 
Protein sequence
MRIVSLPARQ SIQRLLGEHL KKRPGLWLFI SMMLVNAGNY GFNVLVGRWL GPSAYAEINL 
MITLFLIATF ITSGLQMSAA RAIVGSNNQA ILTMLRRVAW VCGGLSLLGL GLFAAFWQTL
FHTASASPFI ILGVGLACYY ALGVERGIAQ GCAHFGRLAW NFQIEMAVRL MGACVFVGLG
MGVAGATIAL SSSIVIAWLD SRRSYATVEQ HRTNNSIQLQ MMLPIIIHLL GQVLINNSDV
LLVKAWFPAM IAGQYAALAL IGRVVFFATA TLGTILFPRV LRSTQHSEQQ RLFWQSIMIT
VAIAGLITLL CKIIPQLILG WLFGEAYLPI ADFLWLYAVA TSCYAIANIS ITYQLAQDRS
FGAWIALLAG SIQIGVMSWF NQSIHQILYS QVGVMLGLLL VLMLYEFYRA RARQKFSKSR
VI