Gene Hhal_0155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0155 
Symbol 
ID4710750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp180465 
End bp182525 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content73% 
IMG OID639854613 
ProductRhs element Vgr protein 
Protein accessionYP_001001751 
Protein GI121996964 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.91616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGGGA GTGCCAAGGA AGGCGGTGGC GCGCACGCGC TGTTTCGGGT GCGTCTACCG 
GGATTGGACC CGGACGTGCT GCGGGTCGAG CAGATTGTCG GTCATCAGGC ACTGGATGAC
GGTGACCGTC TGCAGGTCCG GGTGTGTGGC CCGGTCGGGG ATCGGGTCCG GGAGTGGCCC
GGGCAGCTGG TCGCGTTGAC CCTGGGTTGG GGGGTGGCGC CGCGAACACT GCACGGGGTG
ATCAGCGAGG TGGCCCTCTG CGGGGGGGAG ATGTCCTCGC GGGCGGCGAC CCTGACCGTG
TGCTCGCTGC TCCATCCCCT GGCCGGATCA TACCGGCGGC GGGTCTATCG GCGTTGTTCG
GCCTTGCACA TGGCCCGCGA AGTCCTCCAG GGCGCGCTGC CCGACGGGGT GTCGGTCACC
GTGGGGGTGG AGCGGTCCCT GCCGGAGCGG CCACTGGTCG TCCAGGGCGC TGCGGATGAC
CTGGCCTTCC TGCGCCGGGT GTTGGCCCGG GAGGGGTGTT TCCCGGTGGT GCGGGATGCC
GGTGGGCGGC CAGAGGTCCG CATCGTCGAT CATCTGGCGC AGGCCGAACT CGACGCGGCG
GCGCTGACCT GGCGCCCGGG CGGTGGGCCG ACCCCGACCA CCCGGGCGAC CGTCTCCGAG
GTCTCCCGCC GCTGGACGCT GCAGCCGGGG CAGGTGCGGG TCGGCGGTTT CGATCCGGCG
TTGCCCGACC GCGGGCGTCC CGCCACGGCA GGCACCGACG CCGCGGAGGA CAGTCCGATG
GAGTTGGGGC TCCACGGGGT CACGGCGGGT GGTGAGTCGG CCCATGCAGA GTGGGCCGAG
GCCGTGCACG AGGCCTCCGC GGCCCAGCGC TGCCGTATCG AGGCGGTCGT GGCGACGCCG
TTGCTGCCCG GGATGCGGGT GGTCATCTCC GGGCATCCGG AGGCTTCGCT CAATGGCGCT
TACTGGGTCT ACCGCGCCGA GCACGAGGGG GACCAGGCCG CGGCGATTCA CGGCGGTGGT
GGTGCGGATC GGGTGGACTA CCGGGGGCGG GTGGAGCTCC TGCCGCTCGA CCCCGGTTAC
CGGCCGGCCC CGGTGCCTGC ACCCGCGATC CCGGGGGTCG CCGTGGCATG GGTCGCTGGC
GGGGATCCGG AGCGCGCCGA GGTCGACGAG GCCGGGGCGT ACCGGATTCG CCTGTTCGAC
GAGGCCGAGG CGGCCGATGG GGTCGAGGCA GCCCCACCGG GGCCGCCGGT CTGGGCCGTC
CAGCCGAGCG CCGGTGCGCA GCACGGCCTG CACTTGCCCC TGCTGCCCGG GACGCGCGTG
GCGGTGGCGG GGTTGCACGG CGACCTGGAA CAGCCAGTGA TCCTGGGGGC CCTGAGCAGT
CAGGATCAGC CCGGTCCCGT GACGGACCGC AATCCCCACC AGCACCTGCT GCGGACCGCG
GCCGGGCAGC GCCTGCTGCT GGACGACCGG CCCGGGGCCG AGGGGGCGGA GCTGGCCGTG
GGCGAGGCCG CGCGACTGAG TCTGGAGGGG CACGAGGAGG CCCCCGGGGC AACGCTGGAG
GCCCCCAACG GTTATCTCGA GCTGGCCAGC GGCGGAGAAC AGCGCGTGCG CAGCGGTGGC
GATCAGCGGC TCGATGTGGC CGGGGCCTAT CGGGTGGAGG TCGAAGGGAG CTACCACCTG
GAGACGGAGG ACGGCGCGTT GCACTGGTTT GCCGGCGACA CCCTGCATCT GGAGACCGGC
GCGGGGGATC TGCTGCACGA GGCGCCAGAC GGGGAAGTGG CCCTGAAGGC GGGGCGTCAG
GTGTCGCTGG ATGCGGGATC AGGATTGCGG TTGATCGCCC GACAGGGTGC GGGCGACTGG
CAGGTTGAAT CGGGCGCGCT GCGCCTGGAG GCGGCCGGCG ACGTCGCCCT GGTCAGCACC
GGGGGCGGCT GTCTGCAGCT CGGCGACGGC TTCGGGCTGC GCATCGAGGA CAGTGGTGCG
GTCTATGTCG AGGGGCGGCA CATTGAGCTC AGCGCCGAGC AATTGGTGAT CGCTGCCGAC
CGCATCGAGG AGGGGCGGTG A
 
Protein sequence
MKGSAKEGGG AHALFRVRLP GLDPDVLRVE QIVGHQALDD GDRLQVRVCG PVGDRVREWP 
GQLVALTLGW GVAPRTLHGV ISEVALCGGE MSSRAATLTV CSLLHPLAGS YRRRVYRRCS
ALHMAREVLQ GALPDGVSVT VGVERSLPER PLVVQGAADD LAFLRRVLAR EGCFPVVRDA
GGRPEVRIVD HLAQAELDAA ALTWRPGGGP TPTTRATVSE VSRRWTLQPG QVRVGGFDPA
LPDRGRPATA GTDAAEDSPM ELGLHGVTAG GESAHAEWAE AVHEASAAQR CRIEAVVATP
LLPGMRVVIS GHPEASLNGA YWVYRAEHEG DQAAAIHGGG GADRVDYRGR VELLPLDPGY
RPAPVPAPAI PGVAVAWVAG GDPERAEVDE AGAYRIRLFD EAEAADGVEA APPGPPVWAV
QPSAGAQHGL HLPLLPGTRV AVAGLHGDLE QPVILGALSS QDQPGPVTDR NPHQHLLRTA
AGQRLLLDDR PGAEGAELAV GEAARLSLEG HEEAPGATLE APNGYLELAS GGEQRVRSGG
DQRLDVAGAY RVEVEGSYHL ETEDGALHWF AGDTLHLETG AGDLLHEAPD GEVALKAGRQ
VSLDAGSGLR LIARQGAGDW QVESGALRLE AAGDVALVST GGGCLQLGDG FGLRIEDSGA
VYVEGRHIEL SAEQLVIAAD RIEEGR