Gene Haur_2507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2507 
Symbol 
ID5734388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3202591 
End bp3204132 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content53% 
IMG OID641279647 
Producttype II secretion system protein E 
Protein accessionYP_001545273 
Protein GI159899026 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000823712 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACTTC TCAAACGTCT TGGAAGCACG CCAACAAGCC CAGAGCCAAC TCCTCCTGCT 
CAGCCAGTTC AACCGATGGC TGGCGAACAA TCTATGCCTG AATATCGCTC ATCATCGTTG
ACTCCACTAA GCCCACCACC ACCAGCACCA TCATCGCCAC CCATGCCTGG CGGTAACCTC
AGTGCTTTGG CTGGCTCTAG CTCAGGGTTA AGTGGCGGTA CGATCAACAC CAGCGGCTTT
TTGGTCTTGG CTGATCGCGA TGCCCATCGC AAAATGCTGG AGCTTTCGCT CTGGATTGTC
GATAAAATTC AGGCTTCGGT TGGTTCGCAA ACCCAACTAC AACGCAACGA AGATTCGGAG
CGCCTCATCC AAGAGCGCTT TACCACCTTT GTGCGTCAAT CCACTACCAA TCTCGACCAA
GATGGCACCA AATTGCTCTA TCAAATGGTG CTCGATGAAT TATTTGGCTT CGGGCCACTT
GAGCAATTGA TTCGCGACGA TACCATCACT GAAATTATGG TCAACAGCGC CACGGTGGTG
TATGTTGAAC AAAAAGGTAA ATTAACCCTT TCGCCAGTCA TTTTTGCCTC GGAAGATCAT
GTGCTCAAAG TGATCGACCG AATTATTCGG CCACTCGGGC GACGGGTTGA CCGCAAATGG
CCCATGGTCG ATGCGCGTTT GCCCGATGGT TCGCGGGTCA ATGCCATCAT CCCGCCTTGT
GCAATCGATG GTTCGTCGCT GAGTATTCGT AAATTCTCCA AGAAAAAACT CCAAGTTAGC
GATTTGATCA ATTATGGCTC GATGACCAAG GAAATGGCCG ACTTCCTGAA TGCTTGTGTG
GTCAGCGCCA TGAACTTAAT CGTCTCGGGT GGTACTGGTT CGGGTAAAAC CACACTCCTA
AACGTGCTCT CCAACTTCAT CCCCGACCAC TATCGGATTT GTACAATTGA AGACTCAGCC
GAACTACAAC TTGGTAAAGA TCACGTGATT CGGCTTGAAT CCAAGCCTGC CGATGTTGAT
GGCTCGGGCT TGGTCACCAT CCGCGACCTT GTGAAGAACT CGCTGCGGAT GCGCCCTGAC
CGCATTGTGG TAGGGGAAAT TCGTGACGGC GCGGCGCTCG ACTTGCTCCA AGCCATGAAC
ACTGGCCACG ACGGCTCAAT GTCCACCGTC CACGCCAACA CCCCACGCGA CGCAATTCGC
CGTTTGGAAA CCTTGGCACT GATGTCGGGA CTCGATTTAC CAGTTGCGGT TATTCGCGAA
CAAATTGCTT CGGCAGTACA CGTGATTGTG CAGCAAGCCC GTTTGCGCGA CGGCTCACGG
AAAGTCGTAG CGGTAACTGA AGTTCAAGGC ATGGAAAGTG GTCAGGTCGT GCTACAAGAT
ATCTTCATCT TTGAAGATCA AGGTACTGCG CCAGATGGCA AAGTGATGGG TGTGCTGCGG
CCAACCGGGA CGCGTCCACG CTTTACGCCA GTGCTCGAAG CCAAAGGCTT CAAATTGCCT
CCATCAATCT TCGGCGCAAC CTTCCCCGGC ATGCGGCGCT AA
 
Protein sequence
MSLLKRLGST PTSPEPTPPA QPVQPMAGEQ SMPEYRSSSL TPLSPPPPAP SSPPMPGGNL 
SALAGSSSGL SGGTINTSGF LVLADRDAHR KMLELSLWIV DKIQASVGSQ TQLQRNEDSE
RLIQERFTTF VRQSTTNLDQ DGTKLLYQMV LDELFGFGPL EQLIRDDTIT EIMVNSATVV
YVEQKGKLTL SPVIFASEDH VLKVIDRIIR PLGRRVDRKW PMVDARLPDG SRVNAIIPPC
AIDGSSLSIR KFSKKKLQVS DLINYGSMTK EMADFLNACV VSAMNLIVSG GTGSGKTTLL
NVLSNFIPDH YRICTIEDSA ELQLGKDHVI RLESKPADVD GSGLVTIRDL VKNSLRMRPD
RIVVGEIRDG AALDLLQAMN TGHDGSMSTV HANTPRDAIR RLETLALMSG LDLPVAVIRE
QIASAVHVIV QQARLRDGSR KVVAVTEVQG MESGQVVLQD IFIFEDQGTA PDGKVMGVLR
PTGTRPRFTP VLEAKGFKLP PSIFGATFPG MRR