Gene Haur_4146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4146 
Symbol 
ID5736007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5294260 
End bp5295387 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content55% 
IMG OID641281300 
Productcitrate (Si)-synthase 
Protein accessionYP_001546906 
Protein GI159900659 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACTG AAACCCAAGT TCATGTAGGT TTAGAAGGGA TCGTTGCAGC GGCAACGCGG 
CTCAGCAGTG TCGATGGCCA AGCTGGGGAA TTAATTATTG CAGGCTTTCC CTTGGAGCGT
TTGGCCCCGT TTGCGACCTT CGAGGAAACG ATTTTTCTCT TGTGGAATGA CCATTTGCCA
AGCCAAAGCG AGTTGGCCGA ATTGCGCCAG AGCCTTGCCA GCCAGCGCCA ATTGCCAGCC
CTTACCTTAG AAGTCGCTCA ACAGCTGGGC CGCGAACAGG CCGACCCGAT GGATGCATTG
CGGGCTGCGA CCGCTACATT AAACCCATTG GCTGACGAAA AAGCTACGGC CCAACGAATT
GTGGCAGCCT TGCCCACGAT TGTGGCGGCC TATTGGCGAG CACGCAACCA AGCCGAATTT
ATCGAGCCAC GCAGCGATTT AAGCCATGCT GCCAATTATT TGTGGATGTT GACTGGCAAA
GAGCCAAGCG CTGAGAAGGT GCGGGCACTC GAAACCTATC TCAACACCGT AGTTGACCAT
GGCCTGAATG CCTCGACCTT CACTACTCGC GTGATCATCT CGACTGAATC GGATTTGGTT
TCGGCGATTA CTGGGGCGAT TGGAGCGCTC AAAGGGCCGT TGCATGGCGG CGCACCTGGC
CCAGCCTTGG ATATGGTATT TGAAATTGGC ACAGCCGATC GCGCTGAGGA AGTACTTCGC
GCCAAGTTAG CACGCGGCGA GCGCTTGATG GGCTTTGGCC ATCGCGTCTA CAAGGTGCGC
GATCCACGGG CCGAGGTTTT GGCAGGCGCA GCCGATCAAC TTTTTGCCAA CGATGGCAAC
CGCGAGTTAT ACGAACTTGT ACGCCATGTT GAGCAAACCG CGATTCGGCT GCTCGAAGAA
CACAAGCCAG GCCGCAAATT GCAAACCAAT GTCGAGTTCT ACACTGCCTT GCTGTTGCAT
GGCATCGATT TTGAAACCGA CCTGTTTACC CCAACCTTTA CGATCAGCCG CGCTGTTGGT
TGGATTGCCC ACGCCTTCGA GCAACGCGCC GTTGGCCGAA TTATTCGCCC ACAATCGATT
TATACTGGCG AACGCAACCG CACGTGGGTT GAGGTTGCCG AGCGGTAA
 
Protein sequence
MATETQVHVG LEGIVAAATR LSSVDGQAGE LIIAGFPLER LAPFATFEET IFLLWNDHLP 
SQSELAELRQ SLASQRQLPA LTLEVAQQLG REQADPMDAL RAATATLNPL ADEKATAQRI
VAALPTIVAA YWRARNQAEF IEPRSDLSHA ANYLWMLTGK EPSAEKVRAL ETYLNTVVDH
GLNASTFTTR VIISTESDLV SAITGAIGAL KGPLHGGAPG PALDMVFEIG TADRAEEVLR
AKLARGERLM GFGHRVYKVR DPRAEVLAGA ADQLFANDGN RELYELVRHV EQTAIRLLEE
HKPGRKLQTN VEFYTALLLH GIDFETDLFT PTFTISRAVG WIAHAFEQRA VGRIIRPQSI
YTGERNRTWV EVAER