Gene Haur_4016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4016 
Symbol 
ID5735877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5122957 
End bp5124075 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content55% 
IMG OID641281166 
Productcystathionine gamma-synthase 
Protein accessionYP_001546776 
Protein GI159900529 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.683871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCTG AAACCCTGTT AATTCATGCT GGTCGTAGCG TCGATGCTGC GACCAGTGCA 
GTTACGCCGC CGATTCATCT TGCCAGTACC TTTGAACGAG CGGCTGATGG TAGTTTGCCC
CATGGGTTTG TCTATACACG TTTTGGTAAC CCCACCCGCC AAGCCTTGGA ACAAGCCTTA
GCGGCACTTG AAGGTGGAGT TGAGGCGGCG GCATTTGGCT CTGGTTCGGC GGCTACAACC
GCAGTGCTGC AAAGTTTAGC GCCAGGCCAG CGCGTGTTGT TGCCACGCGA TTGCTACAAC
GGCACGGCCA ATTTGGTGCG TCAGGTATTT GCTCAACTTG ATGCTCAATT TGTCGATATG
ACCGACTTGG CGGCAGTGCA AGCAGCGTTG GAGCCAGCCC CAGCCTTGGT TTGGCTCGAA
ACGCCATCCA ACCCAACCTT GCGATTAACA GATTTGGCGG CGGTGAGCAA TTTGGCGCAT
GCGGTTGGGG CATTGGTGGT TTGCGATAAT ACCTGGGCCA CGCCGCTTGG CCAGCGCCCA
TTTGATTTGG GCGTGGATTT GGTGATGCAT TCGACCACCA AATATCTTGG CGGCCATAGC
GATGTGCTGG GTGGTGCATT AATTACCAAA ACTGTAACAC CTTGGTGGCA ACGGTTGCAG
CAAATTCATG TGCTGGCCGG AGCCGTGCCC TCGCCGTTTG AATGTTGGCT GATTTTGCGC
GGCATGCAAA GTTTGGCCTA TCGGCTGCGC GGCCATTGTG CCAATGCTTT GGCGGTGGCT
GAGTGGTTGG CACAACATCC CAAGGTGCAG GCTGTGCATT ATCCTGGCTT GACAAGCCAT
CCTCAATTTG AATTGGCTCA ACGCCAAATG CTGCTGATGG GCGGTATGGT TTCGTTCGAG
GTAGTTGGTG GTGCGGCTGA AGCGATTGCA GTAGCGGCCC ACGTTAAGTT ATGGACGCGG
GCAACCAGCC TTGGTGGCCC TGAAAGTTTG ATCGAGCATC GGGCCACACT CGAAGGCCCA
GATTCGCCAA CCCCGCCAGC CTTGTTGCGG CTTTCGGTCG GCCTCGAACA CCCCGATGAT
TTGATTGCCG ATTTGGCCCA AGCCTTGGCA GTATTGTAG
 
Protein sequence
MKPETLLIHA GRSVDAATSA VTPPIHLAST FERAADGSLP HGFVYTRFGN PTRQALEQAL 
AALEGGVEAA AFGSGSAATT AVLQSLAPGQ RVLLPRDCYN GTANLVRQVF AQLDAQFVDM
TDLAAVQAAL EPAPALVWLE TPSNPTLRLT DLAAVSNLAH AVGALVVCDN TWATPLGQRP
FDLGVDLVMH STTKYLGGHS DVLGGALITK TVTPWWQRLQ QIHVLAGAVP SPFECWLILR
GMQSLAYRLR GHCANALAVA EWLAQHPKVQ AVHYPGLTSH PQFELAQRQM LLMGGMVSFE
VVGGAAEAIA VAAHVKLWTR ATSLGGPESL IEHRATLEGP DSPTPPALLR LSVGLEHPDD
LIADLAQALA VL