Gene Hhal_1402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1402 
Symbol 
ID4711146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1514973 
End bp1516241 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content65% 
IMG OID639855869 
Productisocitrate lyase 
Protein accessionYP_001002971 
Protein GI121998184 
COG category[C] Energy production and conversion 
COG ID[COG2224] Isocitrate lyase 
TIGRFAM ID[TIGR01346] isocitrate lyase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAAAG ATCAAGCGGC GATTGAGCGT GACTGGGCGG AGAACCCGCG CTGGAAAGGG 
GTGCAGCGGG GCTACGGGGC CGATGAGGTC GTCCGTCTCC GGGGGACCGT GCACGTCGAG
TACAGCCTGG CGCGCCAGGG TGCAGAGAAG CTGTGGCAGT CCATGCACGA GATGCCCTAC
GTCAACGCCC TCGGCGCCCT GACCGGCAAC CAGGCCCTGC AGCAGGTCAA GGCCGGGCTC
AACGCCATCT ACCTCTCGGG CTGGCAGGTG GCCGCCGACG CCAACCTCGG TCAGACCATG
TATCCCGACC AGTCGCTCTA CCCGGCGAAC TCAGTCCCTG CGGTGGTCGA TCGCATCAAC
AACGCGCTGC TGCGCGCCGA CGAGATCAAC CACGCCGAGG GCAACCCGCC GTTCGACTTC
ATGAAGCCCA TCGTGGCGGA CGCCGAGGCC GGTTTCGGCG GCGTGCTGAA CGCCTTTGAG
CTGATGAAGG GGATGATCCG CGCCGGTGCC GCGGGGGTCC ACTTCGAGGA TCAGCTGGCT
TCGGTGAAGA AGTGCGGCCA CATGGGCGGC AAGGTGCTGC TGCCCACCCA GGAGGCGGTG
CAGAAGCTCA TCGCTGCGCG CCTGGCGGCC GACACCATGG ATGTGCCGAC CATCCTGGTC
GCCCGTACGG ACGCCGAGGC GGCAGACCTG CTGACCTCCG ACGTGGACGA CAACGACAAG
CCGTTCATTA CCGGCGAGCG CACTGCGGAG GGCTTTTTCC GCACCAAGCC GGGCATCGAG
CAGGCCATCA GCCGCGGCCT CGCCTACGCC CCTTACGCCG ACGTGATCTG GTGCGAGACC
GGCAAGCCGG ATCTCGAATT CGCCCGCGAA TTCGCGCAGG CCATTCACGA GAAGTATCCC
GGCAAGCTGC TCGCCTACAA CTGCTCGCCG TCGTTCAATT GGGCGGGCAA CCTGGACGAG
GCCACCATCC GCAAGTTCCA GGATGAGCTC GGTAAGATGG GCTTCAAGTT CCAGTTCATC
ACGCTGGCCG GCTTCCACTC GCTCAACTAC TCGATGTTCG AGCTGGCCCG CGGCTACAAG
GAGCGGCAGA TGGAGGCGTA CTCCGAGCTG CAGCAGGCGG AGTTTGCCGC GGAGAAACAC
GGTTACACCG CGACCCGTCA CCAGCGGGAG GTGGGCGCCG GCTACTTCGA CCAGGTCACC
AACGTGATCC AGGGCGGCCA GTCCTCGGTG ACAGCGCTGA AGGGGTCGAC GGAGGAAGAG
CAGTTCTAA
 
Protein sequence
MLKDQAAIER DWAENPRWKG VQRGYGADEV VRLRGTVHVE YSLARQGAEK LWQSMHEMPY 
VNALGALTGN QALQQVKAGL NAIYLSGWQV AADANLGQTM YPDQSLYPAN SVPAVVDRIN
NALLRADEIN HAEGNPPFDF MKPIVADAEA GFGGVLNAFE LMKGMIRAGA AGVHFEDQLA
SVKKCGHMGG KVLLPTQEAV QKLIAARLAA DTMDVPTILV ARTDAEAADL LTSDVDDNDK
PFITGERTAE GFFRTKPGIE QAISRGLAYA PYADVIWCET GKPDLEFARE FAQAIHEKYP
GKLLAYNCSP SFNWAGNLDE ATIRKFQDEL GKMGFKFQFI TLAGFHSLNY SMFELARGYK
ERQMEAYSEL QQAEFAAEKH GYTATRHQRE VGAGYFDQVT NVIQGGQSSV TALKGSTEEE
QF