Gene Haur_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2031 
Symbol 
ID5733920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2526773 
End bp2528194 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content47% 
IMG OID641279175 
Productcitrate (Si)-synthase 
Protein accessionYP_001544802 
Protein GI159898555 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01800] 2-methylcitrate synthase/citrate synthase II 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.602388 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAATC CACAAAAAAT CACCAATAAT CAAATTTCAG ATATAACCAA TCTGATTGCT 
CAGGTGCTGC AAATTCCAGA ATATATGGTA ACCGATCAAC TTAGCTACCG TTCAATTATG
CAATGGGATT CATTGGCGCA TATTAACTTG ATGTTGGCAT TAGAAACAAC CTATGGAGTG
ACAATCGAAC CTGATACCAT TGTTACACTT AGTTCTGTTG CCCAAATTAA GGCCTATATG
CAGGGGACGC TTCAGCCGAC ATCAACCCAC ACAGCAGGGA TTGAAACTCA AACTGACGGT
GGTGTGGTCT ATCGTGGCCT TGCAGGAGTA ACCTTCGATC AGAGCCAGAT CACGTTGATC
GATGGCAAAC AAGGCCGCTT GCAATATCGC GGCTATAGCA TTCATGATTT GGTTGAGCAA
ACGACGTTTG AGGAAACAGC CTTTTTATTA TTGAATGGGC AATTGCCAAC AACCACTGAG
TTGGATCAAT TTAAGCACCA GTTGGTGGCA GCACGAGAGA TACCAGCAAC GATTTTGGAG
TTGATTCAGC TCCTAAAAGA TGGCCATCCC ACCGAAGTTT TGCGCACCTG CCTTTCGGCG
CTTGCCACGT TCGATGCTGA GCGCCTTGAT GGATCAGTTA CTGCAACCAA AGCTCGGTCT
ATTCGACTGA TTGCACAAAT GCCCTGCTTG ATTGCAGCTC ATCATGCGAT TCGCCAAGGA
AAAACACCAA TCCAGCCCAA CCCAACGCTT GATCATGCTG CTAATTTTCT CTACATGCTC
CAAGGAACCA TTCCGAGCCA ACAGGCTCAG CAGATCGTGA ATCAGATTCT GATTCTGCAT
GCTGATCATA GTGCCAATGC TTCAACCTTT GCAGCCCGAG TTGTCGCAGG AACACGCTCA
GACTGGTATG CAGCGCTCAC TGCTGCAATT GCCGCCTTTG CAGGCCCACT CCATGGCGGG
GCCATCGAAC AGGTCATTGC CATGATTCAA GCGATCGGTA CTCCTGAGCG AGCTGCCGAT
TATGTAGCCA ACTTACAAGC CAACAATCAG CCAGTGATGG GTTTTGGGCA TCGGGTCTAT
CAAACCGAAG ATCCACGAGC ACGTCATTTA CGCAAAGCAG CGCAGGCCTT GAGCGCACAA
AGCGATAACA ACTATTACGC GATTCTTGAA GCGGTTGTTC AAGCAATGCG TCCCTATATG
GCCAAAGGAA TCGATGTGAA TGTTGATTTT TATGCCAGTG TTATTTATCA CCTTTTGGGT
ATTCCCTACG ATCTGTTTGT TCCAGCATTT ATTGTTGGCC GAACCGTGGG CTGGTTGGCC
CAAATTCAAG AGCAATATGC CAACAATATT TTGATTCGCC CATTGCTTGC CTATGTTGGG
CCAATTGATC AGCCCTATCC AGCATTGAGC CAACGTCAAT AA
 
Protein sequence
MVNPQKITNN QISDITNLIA QVLQIPEYMV TDQLSYRSIM QWDSLAHINL MLALETTYGV 
TIEPDTIVTL SSVAQIKAYM QGTLQPTSTH TAGIETQTDG GVVYRGLAGV TFDQSQITLI
DGKQGRLQYR GYSIHDLVEQ TTFEETAFLL LNGQLPTTTE LDQFKHQLVA AREIPATILE
LIQLLKDGHP TEVLRTCLSA LATFDAERLD GSVTATKARS IRLIAQMPCL IAAHHAIRQG
KTPIQPNPTL DHAANFLYML QGTIPSQQAQ QIVNQILILH ADHSANASTF AARVVAGTRS
DWYAALTAAI AAFAGPLHGG AIEQVIAMIQ AIGTPERAAD YVANLQANNQ PVMGFGHRVY
QTEDPRARHL RKAAQALSAQ SDNNYYAILE AVVQAMRPYM AKGIDVNVDF YASVIYHLLG
IPYDLFVPAF IVGRTVGWLA QIQEQYANNI LIRPLLAYVG PIDQPYPALS QRQ