Gene Hhal_2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2043 
Symbol 
ID4710017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2246108 
End bp2247316 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content73% 
IMG OID639856516 
Productprotein of unknown function Met10 
Protein accessionYP_001003609 
Protein GI121998822 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.338836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAAC GACCCGCACT CCGCCTGAAA CCCGGCGAGG AGCGCCGCCT GCGCGCCGGC 
CACCTGTGGA TCTTCAGCAA CGAGGTGGAT ACCGCCCACA CCCCGCTGCG GGGGTTCGCG
CCCGGCGAGC AGGCCGTGGT GGAGGACGCC CGCGGCAAGG CCCTGGGGTG CGCCTACGTC
AACCCCAACT CGCTGATCTG CGCCCGGCTG GTCAGTCGCG ACGCCAAGGT GGCCCTGGAT
CGCTCGACCC TGGTCCACCG CCTGCAGGTG GCCCTGGCCG CACGCCAGCG CCTCTTCGCC
GAGCCCTGGT ACCGGCTGGT CCACGGCGAG GCCGATGGCC TGCCCGGCCT GGTCATCGAC
CGTTTTGGGG ACTGCTGTGT GGTGCAGCCG AACACCGCCG GGATCGAGCG GTTGCAGGCC
GAGGTGCTGG AGGCCCTGGA GAAGGTCGTC GCCCCGGCGT ACGTCCTCTG GCGGGCCGAC
AACGCCGTGC GCGAGCGCGA GGGGCTGGAC CTGCGCGTGG AGTGGCTCGG CCAGCCGGGG
CCGGAGGAGC TGGAGGTCCG CGAAGGGGGG CTGCACTTCC GGGTCCCGGT GGTCAGCGGG
CAGAAGACGG GTTGGTTTTA CGATCAGCAG GCCAATCGCC GGCGCCTGGC CGCCTACGCC
GGCGACGCCC GGGTGCTGGA CGCCTTCTCC TACGCCGGCG GCTTCGCCAT CGCCGCCGCG
GTGGCCGGCG CCCGCGAGGC GGTGGCCGTG GAGCGCTCCG CCGAGGCGTG TGACCGCATC
GCCGCCAATG CCGAGCGCAA CGGCGTCGGC GATCGGGTGA CGGTGATCGA AGGCGAGGTC
AACGACTACC TGGCGGCGGC CCGTCAGGAG GGCGAGCGCT ACGACGTGGC GGTGGTGGAT
CCGCCGGCGT TCATCAAGCG CCGCCGCGAC CGCAAGGCCG GTGAGCGCGG CTACCGCACG
GTCAACGAGG CGGCCCTGCG CCTGCTCGGC CGCGACGGCG TGCTGCTCAG CTGCTCCTGC
TCGGCCCACC TGCCCGAGGA GCGCCTCTCC GGCATCCTGC TGGCGGCCGG GCGGCACCTG
GACCGCTCCG TGCGCATCCT CGAGCGCGGC GGTCTGCCGC CGGACCACCC GATCCACCCG
GCGATCCCCG AGACCGACTA CCTCAAGGCG CTCTTCATCC GTGCGGTGAT CGCCACCAGC
CTGCCGTGA
 
Protein sequence
MQQRPALRLK PGEERRLRAG HLWIFSNEVD TAHTPLRGFA PGEQAVVEDA RGKALGCAYV 
NPNSLICARL VSRDAKVALD RSTLVHRLQV ALAARQRLFA EPWYRLVHGE ADGLPGLVID
RFGDCCVVQP NTAGIERLQA EVLEALEKVV APAYVLWRAD NAVREREGLD LRVEWLGQPG
PEELEVREGG LHFRVPVVSG QKTGWFYDQQ ANRRRLAAYA GDARVLDAFS YAGGFAIAAA
VAGAREAVAV ERSAEACDRI AANAERNGVG DRVTVIEGEV NDYLAAARQE GERYDVAVVD
PPAFIKRRRD RKAGERGYRT VNEAALRLLG RDGVLLSCSC SAHLPEERLS GILLAAGRHL
DRSVRILERG GLPPDHPIHP AIPETDYLKA LFIRAVIATS LP