Gene Haur_4076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4076 
Symbol 
ID5735934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5202352 
End bp5203932 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content54% 
IMG OID641281227 
Productphosphoenolpyruvate carboxykinase 
Protein accessionYP_001546836 
Protein GI159900589 
COG category[C] Energy production and conversion 
COG ID[COG1866] Phosphoenolpyruvate carboxykinase (ATP) 
TIGRFAM ID[TIGR00224] phosphoenolpyruvate carboxykinase (ATP) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAC GAGCCGTATT AAGTGAAGGC GACACTGTAG GCGCTAATTT ACATGTCAAG 
CAAGCCTATC GCAACTTAAC GGTGCCGCAA TTGGTGGAAG CTGCTCTCAA ACGGGGCGAA
GCCGTTTTAT CGGCTACTGG GGCTGTCGTT GCCACCACGG GCGCACGCAC GGGTCGTTCT
GCAGATGATA AGTTTGTGGT GGAAACACCA GCAGCCGCGT CAATGCACTG GACGAAATTC
CATAAAGCTA TGAAGCCCGA AACCTATGCC ACGATCAAGG CCAAGGCGTT GGCACACATG
GCCGAACGTG AGATGTTTGT TTTAGATGCT AGCGCTGGGG CTGATCCAGC GTATGCGTTG
CCAATTCGCG TGGTGACCGA GTATGCTTGG CATAACTTGT TCGCTAAGCA ATTGTTCCGC
GATGCGATCA GCAGCGATCA ACAACCGCAA TGGACGGTGC TCAACTTGCC AAGTTTGAAG
CTTGATCCAG CGGTTGATGG CTCGCGCTCA GAAGTTGCCG CCATGATCAA TCTCGATGAA
AAATTGATTT TGATTGTCGG TACTGAATAC GCTGGCGAGA TCAAGAAATC GATCTTTACG
GTATTGAACA TGGTGCTGCC AAGCCAAGGC GTGATGCCAA TGCACTGTTC AGCCAACATT
GGCAGCAAGG GCGATGTAGC CTTGTTCTTC GGGCTTTCGG GCACGGGCAA AACCACGCTC
TCAGCCGACC CCGAACGGAT TTTGATTGGC GATGATGAGC ATGGTTGGAG CGCCAACGGC
GTGTTCAACT TTGAAGGCGG CTGCTATGCC AAGTGTATTC GCTTGCGCCG CGAATCGGAG
CCAGAAATTT TCGACGCAAT TCGCTATGGG GCGGTGCTCG AAAACGTGGT GCTCAGCGAT
AGCCGCGATC CCAATTATGA TGATGCGTCG TTGACCGAAA ACACCCGCGC TGCCTATCCC
TTGGAATACA TTCCCAACGT CAGCGAAACG GGTATGGGCG GCCAACCAGA AACGATCATC
TTCTTGACCG CTGATGCCTT TGGAGTTTTG CCGCCAATCG CCAAACTCAG CCCTGAACAA
GCGATGTATC ACTTCTTGTC GGGCTATACC GCCAAGCTGG CTGGCACCGA AACGGGCGTT
GGCTCAGAGC CACAAGCAAC GTTTAGCACC TGCTTTGGCG CACCGTTTAT GCCTTTGCAC
CCAACTGTGT ATGCCGATTT GCTTGGCCAA AAAATGCGCG AACACAAAGT CCGTGTATTT
TTGGTCAACA CTGGCTGGAC TGGTGGTTCG TTCGGGGTTG GCAAGCGCAT GAGTTTGCGC
GATACCCGCA CGATGGTGCA TGCCGCCTTG GCTGGCAAAC TCGATGCTGT GGAAATGTGG
CACGATGAGC GTTTCAATCT CGATGTGCCT GTGGCAATCG AAGGCGTTGA TAACAGTGTG
CTGCAACCCC GCCAAACTTG GGCCGATGCC AGCGAATACG ATCGGGTTGC CGATGACTTG
GCCGCCCGCT TCCGCAAGAA CTTCGAGCAA TACGCCGAAC GCGCTGGCGA AACCGTGGTA
AACGCCGGCC CACAAGCGTA G
 
Protein sequence
MTERAVLSEG DTVGANLHVK QAYRNLTVPQ LVEAALKRGE AVLSATGAVV ATTGARTGRS 
ADDKFVVETP AAASMHWTKF HKAMKPETYA TIKAKALAHM AEREMFVLDA SAGADPAYAL
PIRVVTEYAW HNLFAKQLFR DAISSDQQPQ WTVLNLPSLK LDPAVDGSRS EVAAMINLDE
KLILIVGTEY AGEIKKSIFT VLNMVLPSQG VMPMHCSANI GSKGDVALFF GLSGTGKTTL
SADPERILIG DDEHGWSANG VFNFEGGCYA KCIRLRRESE PEIFDAIRYG AVLENVVLSD
SRDPNYDDAS LTENTRAAYP LEYIPNVSET GMGGQPETII FLTADAFGVL PPIAKLSPEQ
AMYHFLSGYT AKLAGTETGV GSEPQATFST CFGAPFMPLH PTVYADLLGQ KMREHKVRVF
LVNTGWTGGS FGVGKRMSLR DTRTMVHAAL AGKLDAVEMW HDERFNLDVP VAIEGVDNSV
LQPRQTWADA SEYDRVADDL AARFRKNFEQ YAERAGETVV NAGPQA