Gene Hhal_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2020 
Symbol 
ID4710385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2223153 
End bp2224313 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content75% 
IMG OID639856493 
Producttype II secretion system protein E 
Protein accessionYP_001003586 
Protein GI121998799 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.453944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGACG ATCGCGCCAC GGTCCGCCTC GTGGACCGTC TTCTTGCCGA CGCCGTGCGG 
CGTCGGGCCT CGGATATCCA CCTGCAACCG GAGGCCGACC GGGTGCGCGT GCGCCTGCGC
ATCGACGGCC TGCTGCGCGA GGCCGAAGGG CCACCGCCGG GCCTGCGCGG ACGGGTTGCG
GCACGCATCA AACTGCTGGC GGGGATGGAC GTCGCCGAGC AACGCCTGCC CCAGGACGGC
CGGCTGGAGG CCCGGGACGG CGACGGGCAG CGGGTGCAGT TCCGCGTCGC CAGCTGCCCG
GGCGTCCACG GCGAGAAACT GGTCCTGCGC CTGATCGAGC AGGACGCCCC GGCCACCCTC
GACGCCCTGG ACCTCCCCGG CCCGGCCCGG GCGGCACTGG AGTCGGCCCT CGACCGCCCC
GACGGACTGA TCCTGGTCAC CGGCCCCACC GGCTCCGGCA AGACCGCGAC CCTGCACGCC
GCCCTGCGGC GGCTGAACAC CCCCGAGCGC AACATCTGCG CCGTGGAGGA TCCCGTGGAA
ATGGACACCC CCGGGGTCAA CCAGGTGGCC GTGAACCGCC GCGCCGGTAT CGACTTCGCC
CAGGCCCTGC GCGCCTTCCT GCGCCAGGAT CCGGACGTGA TCATGATCGG CGAGATCCGC
GACGCCGAGA CCGCCGCCAT CGCCGTCAAG GCGGCCCAGA CCGGCCACTT GGTCCTCTCA
ACCCTGCATA CACGCAGCGC GCCGGGCGCG GTGGAGCGCC TGGCGCAGAT GGGTCTGCCC
GGCTACGACC TGGCCTCGAG CCTCTCCCTG GTGGTGGCGC AGCGCCTGGT CCGCCGCCTC
TGCCCGGCCT GCCGCGAGAC CAGCAGCGCC GCCGCTCAGC CGGCGGCAGC GGAGCCGGCC
GGCGTCTACC ACCCCCGCGG GTGCCCGGAG TGCCAGGACG GCTATCGCGG TCGGCGGGGG
GTCTTTCAGG TCATGCCGAT GACCGACGCC GTGGCCGATG CCGTCCTGCA TGGCCCTTCG
GCCCGCGAGA TCGAAGCCCG CGCCCGGGCG GCGGGCATGC CGGATCTCCA CGACGCCGGC
TGGCCCCTGG TGGAGACCGG CGAGACCAGC GCCGCCGAAC TACGCCGCGT CACCCGCGAG
GCCGAGCCGT GGCCCGCCTG A
 
Protein sequence
MDDDRATVRL VDRLLADAVR RRASDIHLQP EADRVRVRLR IDGLLREAEG PPPGLRGRVA 
ARIKLLAGMD VAEQRLPQDG RLEARDGDGQ RVQFRVASCP GVHGEKLVLR LIEQDAPATL
DALDLPGPAR AALESALDRP DGLILVTGPT GSGKTATLHA ALRRLNTPER NICAVEDPVE
MDTPGVNQVA VNRRAGIDFA QALRAFLRQD PDVIMIGEIR DAETAAIAVK AAQTGHLVLS
TLHTRSAPGA VERLAQMGLP GYDLASSLSL VVAQRLVRRL CPACRETSSA AAQPAAAEPA
GVYHPRGCPE CQDGYRGRRG VFQVMPMTDA VADAVLHGPS AREIEARARA AGMPDLHDAG
WPLVETGETS AAELRRVTRE AEPWPA