Gene Haur_4237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4237 
Symbol 
ID5736091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5400885 
End bp5403674 
Gene Length2790 bp 
Protein Length929 aa 
Translation table11 
GC content53% 
IMG OID641281392 
Productphosphoenolpyruvate carboxylase 
Protein accessionYP_001546997 
Protein GI159900750 
COG category[C] Energy production and conversion 
COG ID[COG2352] Phosphoenolpyruvate carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.582668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATACAC CTGAAACCGA CCGCACGCCG CTTTCAATGC TGATCCATTC CCTCGGGAAT 
GTGCTTGGCG ATGTGATTGT GGCCCAAGAT GGGGTGTCAG CGTTTGAGCT TGAAGAAGAT
GTGCGCCAAC GCACCAAGCA ACGCCGAAGC GATGGAACAT TGCAAGAGAC TCAAACCCTG
ACTGAGTTGA TTAGTCAATT GCCAGTTGCC CAACTCATGG GGCTGATTAA GGCCTTTACT
CATTATTTTG GCTTGGTCAA TTTGGCCGAA AGCGTTGAGC GCCTACGAGT GCTGGCCGAA
CGCGACCGCC AAAATGGCGA TGCACCACGC TCTGAATCGG TCGAATTGGC ATTGCAAGAG
TTGCGTGATC GTGGCATTAC CGCCCAGCAA GTGCAAGATT TGCTTGATCA TGCCGAGATT
CGGCCAGTTT TTACTGCCCA CCCGACCGAG GCCAAACGCC GAACGACGCT CAAAAAGCAC
CATCGCATTG CAGGGGCGGC GCGGCAATTA ACCGCCGATA CGACCTTTCA ACGCCAGCGC
GAACGCTTGC TCGAATCAAT TCGTGAGGAA GTGATCTCGC TCTGGCAAAG CGATGAGGTG
CGGATTATCA AGCCAACCGT GATCGACGAA GTGAAGAATA ATCTCTATTA CTTCGAAGAA
TCGCTGTTCG ATATGATTCC GCAACTCTAC CGTGATACCG AGGCCTCGTT GCGCCAAATT
TACCCTGAGC ACGAATGGCG TGTGCCAGCC TTCCTGCGCT TTGGCTCATG GGTTGGCGGC
GATCGCGATG GCAATCCCTT TGTAATTCCC TCGGTCACGG TTGAAACGCT TAAACTGTTG
ATGGGCCGTT CGTTGCGTGA GCATATTCAT TCAGTTGAGC GCTTGAGTCA TCGTTTAAGC
CAATCGTCGC GCCAAGTACC AATTAGCGAA GAATTAGCCC AATCGCTAAT CCATGATGCG
CCGTTGTTCC CCGAATTGGC CCAAGTGCTG GAGCGGCGCA ATCCGCATGA GCCATATCGC
CAAAAATGCT CCTACATTCA CGCCAAATTG CATGCCACCT TGGCCTATGT TGAGCGCTAC
GAGCCAGATT GGGCACGCGG CGGCCATCGC CCAGCTGAAG GCACCTGGTA TGCCAATGCC
AACCAATATC TCGCCGATTT AGCAACCATG GAATATAGCT TGCGCACCAA TGGCGCGGCC
TCAGTCGCCG ATGGCTTTTT GCGCGATATC CAATGCTCGG CCAAAGTCTT TGGTTTGCAC
ACCGCCACCC TCGATATTCG CCAACACAGC AGCCGCCACA CCAACGCCCT GAGCGAAATT
TTTGAATATG CAGGCATCTG CGACGATTAC GCCAGCCTGA GCCAAGCCGA ACGCACGGCT
GTATTGGAAC GCGAGCTAGC CAATAATCGT CCGTTGATTC CAACTCATCT CTACTACAGC
CCCGAAACCG TTGAGATTAT CGAAACCTTC CGCACAATTC GCGCAGTGCT TTCCGATTTA
AATGCTGAGG CCATCGAAAC TTACATCATT TCGATGACCG AAGGCCCAAG CGATATTTTG
GCGGTGCTGT TGCTGGCTCG CGAGGCGGGC ATTTATCAGC CAGGTGAGCA TAGTTGGCTG
AATATTGTGC CATTATTTGA AACCGGAGCC GACCTCATCG CCGCGCCGGA GATAATGCAC
ACGCTGCTTT CGAGCGAAGC CTATCGCCAA CATTTGGTGT TGCGCAACGA TGTGCAAGAA
ATTATGTTGG GCTACAGCGA TTCCAACAAA GATGGTGGTT TTGCCGATGC GCACTGGGCG
CTCTATCTCG CTCAAGTGGC CTTGGCCGAA ACCTGTTTCC GACATCGAGT GGCCATGCGG
CTGTTCCATG GCCGTGGTGG GGCGGTTGGC CGTGGTGGCG GGCCTGCCAA CCGTGCGATT
TTGGGTCAAC CACCAGGCAC AGTCGGTGGG CGGATCAAAA TCACTGAACA AGGCGAAGTG
ATTAGCGATC GTTATGCCGA GCCAGAAACG GCCTATCGCC ATCAAGAGCA AATTATCAAC
GCAGTATTGC GCTCATCGTT AGGCGTGAGC ATCGCGCATA TCAGCCAAGA ATGGCACGAC
GCGATGAGTA GTTTGGCCAA GGTTTCGCGT AAAGTCTATC GCGGCTTGGT CTACGATCAT
CCGCACTTCT TGGAATACTT CCGCAATGCT ACGCCGATTA CCGAAATTAG CCGCTTGAAC
ATTGGCTCAC GCCCAGCCAG CCGCAAAGCC AGTGACCGGA TCGAAGATTT GCGAGCGATT
CCCTGGGTTT TTAGTTGGAT GCAAAGTCGG CATACCTTGC CAGGTTGGTA TGGCTTGGGC
AGTGCCTTGG AGCATTTAAT CCAAGCTGAT GCCAATGGCT TGACCACCTT GCAGGGAATG
TACAACGATT GGCCATTTTT CCGCACCATG CTGGATAATG CCCAAATGAT TTTATCCAAG
GCTGATATGG ATATTGCGGC GCAATATGCC CTGCTTGTGC CCGACCAAGC CTTAGCCAAC
GAAATCTTTG GCCTGATCAA AGCTGAATAC ACCCGCACCG TTAAATGGAT TTGCGAGGTG
GCGCAAATTA ATGAGCTGCT GGATACTAGC CCAATTTTGC AGCACTCAAT TAAGCAGCGC
AACCCGTATG TTGACCCATT AAGTTTCGTA CAAATCGAAT TGCTCCGGCG TTTGCGCACC
GATCCCGATG GACTTGAGCA TAGCGATCTT GAAGATGCAA TTTTGTTAAG TATCAACGGG
ATTGCCGCAG GCTTGAAAAA TACGGGTTAG
 
Protein sequence
MYTPETDRTP LSMLIHSLGN VLGDVIVAQD GVSAFELEED VRQRTKQRRS DGTLQETQTL 
TELISQLPVA QLMGLIKAFT HYFGLVNLAE SVERLRVLAE RDRQNGDAPR SESVELALQE
LRDRGITAQQ VQDLLDHAEI RPVFTAHPTE AKRRTTLKKH HRIAGAARQL TADTTFQRQR
ERLLESIREE VISLWQSDEV RIIKPTVIDE VKNNLYYFEE SLFDMIPQLY RDTEASLRQI
YPEHEWRVPA FLRFGSWVGG DRDGNPFVIP SVTVETLKLL MGRSLREHIH SVERLSHRLS
QSSRQVPISE ELAQSLIHDA PLFPELAQVL ERRNPHEPYR QKCSYIHAKL HATLAYVERY
EPDWARGGHR PAEGTWYANA NQYLADLATM EYSLRTNGAA SVADGFLRDI QCSAKVFGLH
TATLDIRQHS SRHTNALSEI FEYAGICDDY ASLSQAERTA VLERELANNR PLIPTHLYYS
PETVEIIETF RTIRAVLSDL NAEAIETYII SMTEGPSDIL AVLLLAREAG IYQPGEHSWL
NIVPLFETGA DLIAAPEIMH TLLSSEAYRQ HLVLRNDVQE IMLGYSDSNK DGGFADAHWA
LYLAQVALAE TCFRHRVAMR LFHGRGGAVG RGGGPANRAI LGQPPGTVGG RIKITEQGEV
ISDRYAEPET AYRHQEQIIN AVLRSSLGVS IAHISQEWHD AMSSLAKVSR KVYRGLVYDH
PHFLEYFRNA TPITEISRLN IGSRPASRKA SDRIEDLRAI PWVFSWMQSR HTLPGWYGLG
SALEHLIQAD ANGLTTLQGM YNDWPFFRTM LDNAQMILSK ADMDIAAQYA LLVPDQALAN
EIFGLIKAEY TRTVKWICEV AQINELLDTS PILQHSIKQR NPYVDPLSFV QIELLRRLRT
DPDGLEHSDL EDAILLSING IAAGLKNTG