Gene Htur_1366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1366 
Symbol 
ID8741956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1417611 
End bp1419224 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content70% 
IMG OID646511943 
Productpara-aminobenzoate synthase component I 
Protein accessionYP_003402927 
Protein GI284164648 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR01824] aminodeoxychorismate synthase, component I, clade 2 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATC CGCGCGTCGT TACCTCGCTC GCGTCGTTTC GAGCCGCCGC CCATGAGCTG 
CTCGAGGGTG ACGACGCCAC ACCGACCGAT AATGCGCCGG CTGACAACGT GACGACCGAC
GTCGCGACGT CACGAGAACC CGACGTTCGA ATTCCAATCG AAGTCCGCGT CGCCGTCGAC
GATCCGTTTC TCGCCTATCG ACGGGCGCGC GATGCCGACG CGGGCGGCGC CTTCCTCGAG
ACGACCGGCG GCCAGCCCGG CTGGGGCTAC TTCGGCGTCG ACCCCGTCGA CCGGCTGACG
GTCGGGCCCG ACGCGGTCGC GCGAACTGAC GACGAGGATT CTCCGACGCT GGCGGCCCTC
GAGGGGCTCC TCGAGCAGGA CCAGCTGGTT CGCGGCGACT GTTCGGTCCC CTACCCCTGC
GGGGCGATCG GCTGGCTCTC CTACGACGTC GCCCGCGAAC TCGAGTCCCT TCCCGAGTCG
GCCGTCGACG ATCGGGGGCT TCCCCGCCTC GAGATCGGCG TCTACGACCG GCTGGCGGCC
TGGGAAGCGC CGACCGACGA CGGTGAGGTG ACGCTGCGGG TGACGGCCTG TCCGCGAATC
GCGGTCGGCG ACGGCCGCTC CGACGAGACG CTCGAGGCGG CCTACGAACG CGGCCGCGAC
CGGGCGCTCG AGCTCGCGCG GGCCGCCCTC GAGGGCGATC CCGCGGTCGA CGAGCCGCCA
GTCGCGACGT CCGAAGCGAC GTTCGAGAGC GACTGCGGCC GCGAGGCGTT CGCCGAGCGC
GTCCGTCGAG TCAAGGAGTA CGTCCGTGAC GGCGACACCT TTCAGGCGAA CGTCTCCCAG
CGGCTGGTCG CCCCCGCGGC GGTCCACCCC GTCGCGGCCT ACGACGCCCT CCGACGGGTC
AACCCCGCGC CGTACTCGGG GCTCCTCGAG TTTCGTGCGG CCGATCTGGT GAGCGCGAGT
CCCGAGCTAT TACTGGAACG AAATGGCGAC TTCGTCCGGA CGGAACCCAT CGCGGGCACG
CGACCGCGCG GCGAGACGGC CGAAGACGAC CGAGAACTCG AGGAGGACCT CCTGACCGAC
GAGAAGGAAC GCGCCGAACA CGCAATGTTG GTCGATCTGG AACGTAACGA CCTCGGGAAG
GTCTGCGAGT ACGGCTCCGT GACGGTCGAC GAGTACCGGC GGATCGACCG CTACTCGGAG
GTGATGCACC TCGTCTCGAA CGTGACCGGA CGACTGCGCG ACGACGAGTC GCTGGCCGAC
GCTATCGCGG CGGTCTTCCC GGGCGGTACG ATCACCGGCG CGCCGAAGCC GCGGACGATG
GAAATCATCG ACGAACTTGA GGCGACCCGT CGGGGCCCCT ACACGGGCAG CGTCGGAATC
TTCGGTTTCG ACGGGCGGGC GACGCTGAAC ATCGTCATCC GGACGCTCGT CCGCCACGCC
GAGGAGTACC ACCTCCGCGT CGGCGCCGGG ATCGTCCACG ACTCCGATCC CTACCGCGAG
TACGACGAGA CCCTCGACAA GGCCCGCGCG CTGATCGCGG CCGTCGACGA GGCACTGGGC
GAGCGGGCCG GAATGGCGCT CGAGGCTGAA GGCAGAGGTG AGCAGCGTGA GTGA
 
Protein sequence
MSDPRVVTSL ASFRAAAHEL LEGDDATPTD NAPADNVTTD VATSREPDVR IPIEVRVAVD 
DPFLAYRRAR DADAGGAFLE TTGGQPGWGY FGVDPVDRLT VGPDAVARTD DEDSPTLAAL
EGLLEQDQLV RGDCSVPYPC GAIGWLSYDV ARELESLPES AVDDRGLPRL EIGVYDRLAA
WEAPTDDGEV TLRVTACPRI AVGDGRSDET LEAAYERGRD RALELARAAL EGDPAVDEPP
VATSEATFES DCGREAFAER VRRVKEYVRD GDTFQANVSQ RLVAPAAVHP VAAYDALRRV
NPAPYSGLLE FRAADLVSAS PELLLERNGD FVRTEPIAGT RPRGETAEDD RELEEDLLTD
EKERAEHAML VDLERNDLGK VCEYGSVTVD EYRRIDRYSE VMHLVSNVTG RLRDDESLAD
AIAAVFPGGT ITGAPKPRTM EIIDELEATR RGPYTGSVGI FGFDGRATLN IVIRTLVRHA
EEYHLRVGAG IVHDSDPYRE YDETLDKARA LIAAVDEALG ERAGMALEAE GRGEQRE