Gene Haur_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2007 
Symbol 
ID5733896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2486215 
End bp2487498 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content50% 
IMG OID641279151 
Productcitrate synthase I 
Protein accessionYP_001544778 
Protein GI159898531 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01798] citrate synthase I (hexameric type) 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTAACA GCTTGACGAT CACCGATAAT CGTACTGGGA AAACTTACGA ATTGCCCATC 
GTCGATGGCA CCATCCGCGC CCTCGACTTA CGTCAGATCA AAGCAGATGC CGAAGATTTT
GGGCTGATGA CCCACGACCC AGGCTTCAAT AACACCTCAT CATGCCGTAG CTCAATTACC
TATATCGATG GCGACGCTGG GATTCTGCGC TATCGCGGCT ACCCAATTGA GCAATTGGCT
GATAGCAGCA GCTACCTCGA AACCGCCTAC TTGATTCTCA ATGGAGAACT GCCAACCAAA
CCACAACTTG ATGCATGGGT TCATGAAATT ACCCACCGCA CAATGGTGCA TGAAAATATC
AAGAAGTTAA TGGATGGCTT TCATTTTGAT GCCCACCCCA TGGGGATGTT GATTAGTACC
TTGAGTGCAA TGTCAACCTT CTATCCTCAA GCCAAAAACG TCAAAGATCC AGCAATGCGC
CGTTTGCAGA TTGCTCGCTT GATCGCCAAA GTGCCAACGA TCTCGGCCTA TGCCTATCGC
AAACGTATGG GCTTGCCCTA CGTTTACCCC GATAACTCGT TGAGCTATAC TGGCAACTTC
TTGAAGATGA TGTTCCAAAG AGCTGATCCA TACATTCCAG ACCCAATTAT GGAAAAAGCC
TTGGATGTTT TGTTCGTTTT GCATGCCGAC CACGAGCAAA ATTGTGGCAC GAATGCGATG
CGCTCAGTGG GTAGCTCAAA CGTCGATCCC TATTCAGCCA TGGCTGGGGC GGCGGCAGCC
TTGTATGGTC CGCTGCATGG CGGCGCAAAC GAGCAAGTGC TGCGCATGTT GCAAGAAATC
GGTTCCGCCA GTAATGTTGC CGATTACATC CGCCGCGTCA AAAATCGTGA AGTCTTGTTG
ATGGGCTTCG GCCATCGCGT CTACAAGAAC TACGATCCTC GCGCTGCGAT CGTCAAGCAG
CTGGCCTACG ATGTGTTTGA AGTTGTTGGG CGTAACCCAA TGATCGACAT TGCCTTGGAA
CTTGAAAAGA TTGCGCTCGA AGATGATTAT TTCGTCTCAC GCAAGCTGTA CCCCAATGTC
GATTTCTACA CGGGCATTAT TTACCAAGCA ATGAAATTCC CTGTTGATAT GTTCCCAGTG
CTGTTTGCCA TCCCTCGCAC GGTCGGTTGG TTGGCCCAGT GGGATGAAAT GCACAACGAC
AAGGAAACTT CAATTGCTCG CCCACGCCAG ATCTACACTG GCTACGATGC TCGCGATTTC
GTTCCAGTCG AAAAACGCGG CTAA
 
Protein sequence
MTNSLTITDN RTGKTYELPI VDGTIRALDL RQIKADAEDF GLMTHDPGFN NTSSCRSSIT 
YIDGDAGILR YRGYPIEQLA DSSSYLETAY LILNGELPTK PQLDAWVHEI THRTMVHENI
KKLMDGFHFD AHPMGMLIST LSAMSTFYPQ AKNVKDPAMR RLQIARLIAK VPTISAYAYR
KRMGLPYVYP DNSLSYTGNF LKMMFQRADP YIPDPIMEKA LDVLFVLHAD HEQNCGTNAM
RSVGSSNVDP YSAMAGAAAA LYGPLHGGAN EQVLRMLQEI GSASNVADYI RRVKNREVLL
MGFGHRVYKN YDPRAAIVKQ LAYDVFEVVG RNPMIDIALE LEKIALEDDY FVSRKLYPNV
DFYTGIIYQA MKFPVDMFPV LFAIPRTVGW LAQWDEMHND KETSIARPRQ IYTGYDARDF
VPVEKRG