Gene Haur_0214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0214 
Symbol 
ID5732109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp248414 
End bp249409 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content47% 
IMG OID641277338 
Productperiplasmic component of the Tol biopolymer transport system-like 
Protein accessionYP_001542994 
Protein GI159896747 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAC TTTTTATGTT TACAGCTTTT TTATTAATAG CTAGTGGATG TACTATGCAA 
ACACCTATGA CTCAGATCCC AACTACACCG ATTACACCAA CCCAATTAGC CGTCCCTGCG
GGAATTTATT CAGGGCTAGC TTGGCTTGAT CAAGGTCTAG TGCTACAAGC ACGAACAGCC
AACAATCCTG TTGAGAATTT GTATTGGATT GATCAACAAG GAAACCTTGG TGAATCATTA
TCAATTCCAC TTGAACAAGC ATTTGTTATG ACTTCATATT ATTTCCCCCA ACGCCTTCCT
AATGGCAAAC TTGGCCTGCG GCGCTATAAC TGGAATCCAA ATCTTGAGAC GGGCATTTCC
GAACGTGAGT TTGGTGTATG GCAATTCGAT CCAACAACCA ATGATCTTAC CCCACTTTTA
CAACCTGCTT TACCCCAAGA TCTCAGCCAA CATCTGCGGT TTAGCCTTGC TCCTGATATG
CAACGCGCGA TGCTCTCCGA TGGCGGCTAT CTTCAGTCAC GGCTCTTTTG GTGGTCGGCG
GAGGCAGGTC ATCAACCGCT TGATGCAGGG GTGGCTATCT GTCAATACTT TGCATGGTCA
CCCGATGGTA CGACCATTGC CTATGCTGGT TCACCGCATG CAGCCGATTC CATGGCAACC
TTAGGTGGAG TGCGCTCAAC ACTCTATTTG ATGGATAGTG ATGGTGGGAA TCGGCGCGAA
ATCGGCACGA ATATTCGCAA TGTTTCAGGG TTGCAATGGT CGCCCGATGG TCAATGGCTG
GTCGTTCTGG GCTACTTCGA TGGATTCGAT AATCAGGTCT GGTTGGTAAA TCCTACGAAA
GCTGAGTGGC ATCAACTCAC GACTACGGTT GGAAATTATC AATGGCCTGC ATGGTCACCC
GATGGCAAGC AGATCGCCGT CATTTGGCGT AAACCAGCTG ATCTTGGCCC CAGTGATTAT
GTGATGACGT TGGATGTTTC AGCGTTTATG CAATAA
 
Protein sequence
MKQLFMFTAF LLIASGCTMQ TPMTQIPTTP ITPTQLAVPA GIYSGLAWLD QGLVLQARTA 
NNPVENLYWI DQQGNLGESL SIPLEQAFVM TSYYFPQRLP NGKLGLRRYN WNPNLETGIS
EREFGVWQFD PTTNDLTPLL QPALPQDLSQ HLRFSLAPDM QRAMLSDGGY LQSRLFWWSA
EAGHQPLDAG VAICQYFAWS PDGTTIAYAG SPHAADSMAT LGGVRSTLYL MDSDGGNRRE
IGTNIRNVSG LQWSPDGQWL VVLGYFDGFD NQVWLVNPTK AEWHQLTTTV GNYQWPAWSP
DGKQIAVIWR KPADLGPSDY VMTLDVSAFM Q