Gene Hhal_1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1199 
Symbol 
ID4710363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1304016 
End bp1305272 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content70% 
IMG OID639855672 
Productdiaminopimelate decarboxylase 
Protein accessionYP_001002776 
Protein GI121997989 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0292186 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACAC CCCCTGGCTT CGCCCGCTAC GCCGATCGCC TGTGGGCTGA AGCACTGCCC 
CTGGACGAGC TCGCCGAACG TTTCGGCACC CCGTGCTACG TCTACTCCCG CGCCGCCATC
GAGGAGCGCT GGGCACTCTA CCGGAGCGCC CTCGGCGTCG CCGGGGACGT CTGCTACGCC
GTCAAGGCCA ACGGCAACCT CGCCCTGCTG CAACTGCTCG CCCGCCACGG AGCCGGATTC
GACATCGTCT CCGGCGGCGA GCTCGAACGC GTGCTGCACG CCGGCGGGGA CGCCGCGCGG
GTGGTCTTCT CGGGTGTTGG CAAAGGCACC GACGAGATCC GTCGCGCCCT GCAAGCCGGT
ATCCGCTGCT TCAACGTGGA GTCCGCTGCC GAACTCGAAC GCATCGCCAG CGTGGCCGCC
ACCGAGAACA CCCCGGCGCC GGTGGCCCTG CGCGTCAACC CCGACGTCAA CCCCGAGACC
CACCCCTACA TCGCCACCGG CCTGGCCCAG AGCAAATTCG GCATCGCCCT GGAGGAGGCC
GAGGCCCTCT ACCTCCAAGC GGCTAACGAT CCGCGCCTCG AGGTCCGTGG CATCGCCTGC
CACATCGGCT CGCAACTGCT CTCGGTGGCA CCGCTGACCG AGGCGGCAGA ACGGCTGGCG
GCGCTGGCCC GGCGACTGCA GGAGCAGGGC ATCTCCCTGG ACCACATCGA CGCCGGCGGT
GGCCTCGGCG TGCACTACAT CGATGAGCAG CCGCCAACCC CGGCCGAGCA CATCGAGGCG
ATCTCGGCGC CGCTGCGCGA CCTTGGCGTA TCCGTCCTGG TCGAGCCCGG GCGCGCCATT
GTTGCCGAGG CCGGCATCCT GCTCACCCGC ATCGAGTATC TCAAGCACAA CGGCGGCAAG
GAGTTCGCCA TCGTCGACGC CGGCATGAAC GACTACCTGC GCCCAGCGCT CTACGACGCC
GCCCACACCC TGGAGGCGGT CACCCCGTCG GAAGCCGAAC TTCGACCGGT GGACGTGGTC
GGACCGGTCT GCGAATCCGC CGACACCTTC GCCCGCGACT GCACGCTGCC GGCAGCCGCC
GGCGGCTTGC TGGCCATCCG TAGCGCCGGC GCCTACGGTG CCGTCATGGC CTCGCAGTAC
AACGCCCGGC CCCGACCGCC CGAGGTGCTG GTGGACGGCA CCCAGGCGCA CCTGATCCGC
CGGCGCGAGA CCATCGACGA GCTGATGAGC GGGGAATCGC TCCTGCCGGA GGGCTGA
 
Protein sequence
MSTPPGFARY ADRLWAEALP LDELAERFGT PCYVYSRAAI EERWALYRSA LGVAGDVCYA 
VKANGNLALL QLLARHGAGF DIVSGGELER VLHAGGDAAR VVFSGVGKGT DEIRRALQAG
IRCFNVESAA ELERIASVAA TENTPAPVAL RVNPDVNPET HPYIATGLAQ SKFGIALEEA
EALYLQAAND PRLEVRGIAC HIGSQLLSVA PLTEAAERLA ALARRLQEQG ISLDHIDAGG
GLGVHYIDEQ PPTPAEHIEA ISAPLRDLGV SVLVEPGRAI VAEAGILLTR IEYLKHNGGK
EFAIVDAGMN DYLRPALYDA AHTLEAVTPS EAELRPVDVV GPVCESADTF ARDCTLPAAA
GGLLAIRSAG AYGAVMASQY NARPRPPEVL VDGTQAHLIR RRETIDELMS GESLLPEG