Gene Hhal_2355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2355 
Symbol 
ID4709078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2581473 
End bp2582933 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content71% 
IMG OID639856830 
Producthypothetical protein 
Protein accessionYP_001003920 
Protein GI121999133 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.314227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAAGC GTATCGAGGC CGCTGAAGGT CTGGAGTTGG AACGGCTCGA CAGTTGCCGC 
TGGCGGTTGC CGCGGCAGGG GCGGATGCAG GTAGACGGGT TGATCTTCGC CAACGACGCG
TTGATTGAGG ACATCCGCGA TACCGAGGCC GTGCGCCAGG TGGCCAATGT GGCCTGCCTC
CCCGGGGTGG TCGGGCGGTC CATCGGCATG CCGGATATCC ATTGGGGGTT CGGCTTCCCC
ATCGGAGGCG TGGCCGCCTT CGATCCGGAC CAGGGGGGCG TGATCTCGCC CGGAGGGGTG
GGCTACGACA TCAACTGCGG AGTCCGGCTG CTGCGGACGC CGCTACAGGC CGAGGACCTG
GGCGCCCACC TGCCGCGCCT GATGGATCGA CTCTTCGAAC GCATCCCGGC CGGCATGGGC
CGTGGGTACG GCGACACCCT GCTGCGCAAC CGGGATATGC GCCGGTTGCT GCGCGAGGGG
GCGGCGTGGG CCGTGGAGGT GGGGCTGGGC GAGCCCGAGG ATCTGGCCCG GATCGAGGAC
CGTGGGTGCC TGCCCGGCGC CGACCCCGAG GCGGTCAGCG ATCGGGCCAT CCAGCGCGGA
CGGGATCAGG TCGGTACGGT GGGATCCGGC AACCACTTCA TCGAGATCGG CTGTGTGGAC
GATGTCTACG ACGAAGCCGC TGCCCGCCGC CTGGGGCTCG AGGCGGGGAC GCTGACCGTG
ATGATCCACT CCGGGTCACG CGGGCTCGGT CACCAGGTCT GCGATGACTT TCTGGTGACC
ATGGAGCGGA TCACCGGGCG CAACGGCATC GAGCTGCCCG ACCGTCAGCT GGCCTGCGCG
CCGCTGAGCT GCTCCGCCGC CCGGGACTAC CTGGGGGCCA TGCAGGCCGC CGCCAACTTC
GCCTACGTCA ACCGCCAGGC GATGACCCAG CAGGTGCGCC GGGTCTTCGC CGAGGTGCTG
GGGGAGGAGG CGCACCTGGA GCTGGTCTAC GACGTCTCCC ACAACATCGC CAAGTTCGAG
CGCCATCGGG TCGACGGTGA GGAGCGCGAG GTCTGCGTCC ACCGCAAGGG CGCCACCCGC
GCCTTCCCGC CCGGCCACCC GGAACTCCCC GAGGATCTGC GCGGGCTCGG GCAGCCGGTG
CTGCTGCCCG GCGACATGAC CCGCTACTCC TACGTCCTGC TCGGCACCCA GGGCGCCTAC
GCCGAGACCT TCGGCTCCTG CGCCCACGGC GCCGGACGCC GTCTCAGCCG GCGCCAGGCC
AAACGCGCCG CTGAGGGGCG GGACTTGGAT GCCGAGCTGG CCGAGGCTGG TATCGAGGTG
CGCGCCTCGT CCCGGCAGAC GGTGGCCGAG GAGCTGGCCG AGGCGTACAA GGACGTGTCC
GATGTGGTGG ACGTGGTGGC CCACGCCGGC ATTGGCCGCC GGGTGGCCCG CCTGCGTCCG
CTGGGGGTGC TCAAGGGGTG A
 
Protein sequence
MVKRIEAAEG LELERLDSCR WRLPRQGRMQ VDGLIFANDA LIEDIRDTEA VRQVANVACL 
PGVVGRSIGM PDIHWGFGFP IGGVAAFDPD QGGVISPGGV GYDINCGVRL LRTPLQAEDL
GAHLPRLMDR LFERIPAGMG RGYGDTLLRN RDMRRLLREG AAWAVEVGLG EPEDLARIED
RGCLPGADPE AVSDRAIQRG RDQVGTVGSG NHFIEIGCVD DVYDEAAARR LGLEAGTLTV
MIHSGSRGLG HQVCDDFLVT MERITGRNGI ELPDRQLACA PLSCSAARDY LGAMQAAANF
AYVNRQAMTQ QVRRVFAEVL GEEAHLELVY DVSHNIAKFE RHRVDGEERE VCVHRKGATR
AFPPGHPELP EDLRGLGQPV LLPGDMTRYS YVLLGTQGAY AETFGSCAHG AGRRLSRRQA
KRAAEGRDLD AELAEAGIEV RASSRQTVAE ELAEAYKDVS DVVDVVAHAG IGRRVARLRP
LGVLKG