Gene Haur_4613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4613 
Symbol 
ID5736460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5899069 
End bp5900472 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content51% 
IMG OID641281777 
Producttype II secretion system protein E 
Protein accessionYP_001547372 
Protein GI159901125 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCTCC TAAAGCGTAT TGCAGGAAAT ACGTCCCCAT CCTCGGCTTC CGAACCTACA 
GCAGCAGCAC AATCGAGTGC TGCTGCCACA CGCCCTGATG GAGCTATTCC GCGCTCTGCT
TCAGTTTCTG CCCAAGATCG CCTACTTGAT GTTCGACATC GTGTCCAGCG ACGTTTGACT
GAAGAAGTTC GCGATGTTAA TAGCACAAGT GAAACCAAAA TCCGCCAAAC GGTCGAAGAT
CTCTTGAGTG TCGTCCTTGA TAGCGAAAAT ATCGTGTTAA GTCGGGTCGA ACGCCAACAA
TTGGTCGAAT CGTTGATGTC GGATATCGTT GGGCTTGGGC CGCTCGATTC GCTGCTCAAA
GATGAGAGCA TCTCGGAAAT TATGGTCAAT GGGCCAAACA AAATCTATAT CGAACAACGT
GGGAAGCTGA CGCTTTCTGG CACAACATTT ATCGATGATG AACACGCGAT GCGGGTGTTG
TATCGGATTG TGTCGCCTCT TGGCCGCCGG ATCGATGAAA GCTCGCCCAT GGTCGATGCC
CGGCTTCAAG ATGGCTCGCG GGTTAACGCA GTTATTCGGC CTATTTCATT GATTGGTCCA
GTCATCACGA TTCGGAAATT CTCCAAAAAG CCGCTTGGCC CCGAAGATCT GATTCGGTTT
GGGGCGATTA GTCGCGAAAT GATGGAGTTT CTTTCGGCCA GCGTTCGTGC TCGGATCAAT
GTGGTGGTTT CTGGTGGTAC CGGTTCGGGC AAAACGACCT TATTGAATGT GCTTTCCTCA
TTTATCCCTG AAGATGAACG TTTGATTACG GTTGAAAACG CTGCCGAACT TCAGCTCCAA
CAGGATCACG TGATTTCGCT CGAATCGCGG ACGGCCAATA TCGAAGGTAA GGGCGAAATT
TCAATCAACG ATTTGATTAT CAACTGCCTG CGGATGCGAC CTGAACGCAT TATCGTCGGC
GAATGTCGCG GTGGCGAGAC CTTGGCTATG TTGCAAGCAA TGAATACTGG CCACGAAGGC
TCGATGACCA CCCTACACGC CAATACCCCG CGTGACGCGA TTGCCCGGAT TGAAACTATG
TGTTTGATGT CGGGGATGGA TTTGCCGCTC AAGGCTATCC GTGAACAAGT TGCCTCGGCG
ATTGAGCTGA TTGTGCAACA AGCCCGACTT AAAGATGGTT CGCGGCGGGT TATGGCCATC
TCCGAAGTAA CCGGAATGGA AGGCGATTTG GTGGTGCTCC AAGATATTTT CATCTTTGAG
CAAACTGGCC TCGATGAACG TGGTAAGATT GTAGGGTCGC TCCGGCCAAC CGGGGTTCGG
CCACGCTTCC TTGATCGGTT TGAAGCCTTG AATATTTACC TGCCACCGAA CGTCTTTGGC
AATAGTTCAG AGCGCTTTTA CTAA
 
Protein sequence
MSLLKRIAGN TSPSSASEPT AAAQSSAAAT RPDGAIPRSA SVSAQDRLLD VRHRVQRRLT 
EEVRDVNSTS ETKIRQTVED LLSVVLDSEN IVLSRVERQQ LVESLMSDIV GLGPLDSLLK
DESISEIMVN GPNKIYIEQR GKLTLSGTTF IDDEHAMRVL YRIVSPLGRR IDESSPMVDA
RLQDGSRVNA VIRPISLIGP VITIRKFSKK PLGPEDLIRF GAISREMMEF LSASVRARIN
VVVSGGTGSG KTTLLNVLSS FIPEDERLIT VENAAELQLQ QDHVISLESR TANIEGKGEI
SINDLIINCL RMRPERIIVG ECRGGETLAM LQAMNTGHEG SMTTLHANTP RDAIARIETM
CLMSGMDLPL KAIREQVASA IELIVQQARL KDGSRRVMAI SEVTGMEGDL VVLQDIFIFE
QTGLDERGKI VGSLRPTGVR PRFLDRFEAL NIYLPPNVFG NSSERFY