Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2031 |
Symbol | |
ID | 5733920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2526773 |
End bp | 2528194 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641279175 |
Product | citrate (Si)-synthase |
Protein accession | YP_001544802 |
Protein GI | 159898555 |
COG category | [C] Energy production and conversion |
COG ID | [COG0372] Citrate synthase |
TIGRFAM ID | [TIGR01800] 2-methylcitrate synthase/citrate synthase II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.602388 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCAATC CACAAAAAAT CACCAATAAT CAAATTTCAG ATATAACCAA TCTGATTGCT CAGGTGCTGC AAATTCCAGA ATATATGGTA ACCGATCAAC TTAGCTACCG TTCAATTATG CAATGGGATT CATTGGCGCA TATTAACTTG ATGTTGGCAT TAGAAACAAC CTATGGAGTG ACAATCGAAC CTGATACCAT TGTTACACTT AGTTCTGTTG CCCAAATTAA GGCCTATATG CAGGGGACGC TTCAGCCGAC ATCAACCCAC ACAGCAGGGA TTGAAACTCA AACTGACGGT GGTGTGGTCT ATCGTGGCCT TGCAGGAGTA ACCTTCGATC AGAGCCAGAT CACGTTGATC GATGGCAAAC AAGGCCGCTT GCAATATCGC GGCTATAGCA TTCATGATTT GGTTGAGCAA ACGACGTTTG AGGAAACAGC CTTTTTATTA TTGAATGGGC AATTGCCAAC AACCACTGAG TTGGATCAAT TTAAGCACCA GTTGGTGGCA GCACGAGAGA TACCAGCAAC GATTTTGGAG TTGATTCAGC TCCTAAAAGA TGGCCATCCC ACCGAAGTTT TGCGCACCTG CCTTTCGGCG CTTGCCACGT TCGATGCTGA GCGCCTTGAT GGATCAGTTA CTGCAACCAA AGCTCGGTCT ATTCGACTGA TTGCACAAAT GCCCTGCTTG ATTGCAGCTC ATCATGCGAT TCGCCAAGGA AAAACACCAA TCCAGCCCAA CCCAACGCTT GATCATGCTG CTAATTTTCT CTACATGCTC CAAGGAACCA TTCCGAGCCA ACAGGCTCAG CAGATCGTGA ATCAGATTCT GATTCTGCAT GCTGATCATA GTGCCAATGC TTCAACCTTT GCAGCCCGAG TTGTCGCAGG AACACGCTCA GACTGGTATG CAGCGCTCAC TGCTGCAATT GCCGCCTTTG CAGGCCCACT CCATGGCGGG GCCATCGAAC AGGTCATTGC CATGATTCAA GCGATCGGTA CTCCTGAGCG AGCTGCCGAT TATGTAGCCA ACTTACAAGC CAACAATCAG CCAGTGATGG GTTTTGGGCA TCGGGTCTAT CAAACCGAAG ATCCACGAGC ACGTCATTTA CGCAAAGCAG CGCAGGCCTT GAGCGCACAA AGCGATAACA ACTATTACGC GATTCTTGAA GCGGTTGTTC AAGCAATGCG TCCCTATATG GCCAAAGGAA TCGATGTGAA TGTTGATTTT TATGCCAGTG TTATTTATCA CCTTTTGGGT ATTCCCTACG ATCTGTTTGT TCCAGCATTT ATTGTTGGCC GAACCGTGGG CTGGTTGGCC CAAATTCAAG AGCAATATGC CAACAATATT TTGATTCGCC CATTGCTTGC CTATGTTGGG CCAATTGATC AGCCCTATCC AGCATTGAGC CAACGTCAAT AA
|
Protein sequence | MVNPQKITNN QISDITNLIA QVLQIPEYMV TDQLSYRSIM QWDSLAHINL MLALETTYGV TIEPDTIVTL SSVAQIKAYM QGTLQPTSTH TAGIETQTDG GVVYRGLAGV TFDQSQITLI DGKQGRLQYR GYSIHDLVEQ TTFEETAFLL LNGQLPTTTE LDQFKHQLVA AREIPATILE LIQLLKDGHP TEVLRTCLSA LATFDAERLD GSVTATKARS IRLIAQMPCL IAAHHAIRQG KTPIQPNPTL DHAANFLYML QGTIPSQQAQ QIVNQILILH ADHSANASTF AARVVAGTRS DWYAALTAAI AAFAGPLHGG AIEQVIAMIQ AIGTPERAAD YVANLQANNQ PVMGFGHRVY QTEDPRARHL RKAAQALSAQ SDNNYYAILE AVVQAMRPYM AKGIDVNVDF YASVIYHLLG IPYDLFVPAF IVGRTVGWLA QIQEQYANNI LIRPLLAYVG PIDQPYPALS QRQ
|
| |