Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4145 |
Symbol | |
ID | 5736006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5292933 |
End bp | 5294120 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641281299 |
Product | citrate synthase |
Protein accession | YP_001546905 |
Protein GI | 159900658 |
COG category | [C] Energy production and conversion |
COG ID | [COG0372] Citrate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.017146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCGCT ATTTGAGTGC TCGCGAAGTG GCCGAGCATT TGGGAATTAG TGTGCCAACC GTTTATGCCT ATGTAAGCCG CGGGTTGTTG CACTCGGAGC CAGCCGGAGC GCATGCGCAT CGCTATTTAT TTGATGAGGT TGAGCAATTG AAATTACGCC GCGAAGGTCG GCGCAATCCG GCCAGTGTGG CGGCTGGCGC ATTGCACTGG GGCACGCCAG TGCTCGATTC AGCGATCACG CTGATCGACA ATCAACAGGT CTATTATCGT GGGTTCGATG TGCGCGAATT GGCCAGTAGT GCCAGCATCG AAGAGGTTGC GAGCCTGATT TGGGTCGGCG ATCGGCAACA AAGCCAGCAA TTATGGGGAG ATTTGCCAGC GCTAAACATT CCCCAGTTGC CTGATCTACC GTTGCTTGAG CAATTTGTCG TGGCGTTGGC GATGGTTGGC AGCAACGATC ATGCCGCTTT CGATCAACGC CCGCAAGGCT TGCAACGCAC AGCGGTACGC GTGGTGCAGT TGTTGATCAG CGTAGCCGCC AAACGTCCCT TGCAAGGCCC AATCGTGGCG CAATTGGCCG AGGCATGGCA GATTCCGCCA AATTTACAGC CGTTGCTCAA CGCTGCATTA ATTGTCAGCG CCGATCATGA ACTCAATGTT TCATCGTTTA CTGCTCGCTG CGTAGCTTCG GCAGGCACAA CCTTATATGC TGTAGTTAGT GCGGGTTTAG CGGCGTTGGG TGGAGTGCAT CATGGCAAGC AAAGCGAATT AAGCGAACTG CTGTTGGATG AACTTTTGCG TAGCCCCGAT ATCTATGCGG CGTTGGCTCA AAAACTGCGC TTAGGCCAGC CAATTCCAGG TTTTGGCCAC CCGATGTATC CCAATGGCGA CCCACGCGGC AAGGTGTTGC TCGATCTGAT TCGGCAATTA GCGCCGCAAT GTAGCAACGA TATTGATCGC ATTATCCAAG CCGTTTATGA ATTGTTGGGC GATCATCCGA CGATCGATTT TGGCTTGGCG TGGGTTGGGC GGGTGCTAAA TTTGCCCTTG GGCAGCGCAA TGAGCCTATT TGCCCTAGGC CGCAGCGTTG GCTGGATTGG TCATGCCTTA GAGCAATATA GCGATGCACG GTTGATTCGG CCACGCGCTC GCTACACCGG CGAACGGCCA AATATAATCA AGATTTAA
|
Protein sequence | MTRYLSAREV AEHLGISVPT VYAYVSRGLL HSEPAGAHAH RYLFDEVEQL KLRREGRRNP ASVAAGALHW GTPVLDSAIT LIDNQQVYYR GFDVRELASS ASIEEVASLI WVGDRQQSQQ LWGDLPALNI PQLPDLPLLE QFVVALAMVG SNDHAAFDQR PQGLQRTAVR VVQLLISVAA KRPLQGPIVA QLAEAWQIPP NLQPLLNAAL IVSADHELNV SSFTARCVAS AGTTLYAVVS AGLAALGGVH HGKQSELSEL LLDELLRSPD IYAALAQKLR LGQPIPGFGH PMYPNGDPRG KVLLDLIRQL APQCSNDIDR IIQAVYELLG DHPTIDFGLA WVGRVLNLPL GSAMSLFALG RSVGWIGHAL EQYSDARLIR PRARYTGERP NIIKI
|
| |