Gene Dret_2197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2197 
Symbol 
ID8420053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2499979 
End bp2501079 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content58% 
IMG OID645038796 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_003199059 
Protein GI258406317 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGT TTTCCCGTGT GCGACCGGAG ATTGCCGGTT TGAAACCCTA TACCCCTGGG 
CTGTCTATCG AAGAAATCAA GGATCGGTAT GCATTGACCA CTGTGTGCAA GATGGCCAGC
AACGAAAATC CACTAGGCAC CTCGCCGCTG GTCCAGGAGG CGTTGTGCCG GTTTGCGCCG
TATGCCTTTC GCTATCCGCG CGGCGGATGT CCGGATTTGA GTGCGGCTTT GGCCAAAGTG
CTGGGGGTCC CCGGGGAGTG CGTCGTGGTC GGCAACGGTT CGGATGAATT GATTGATCTC
TTGATCCGAA CCACGCTGCG CCCCGAAAAG GACAATATGG TCGTTTTTGA TCCCAGCTTT
AGTATTTATC GGATGCAGGC CACCTTGTGC GGTGTGGAAT GCCGACAGGT TCCGCTTGAG
CAGGACCTGA CATTTGACTT TGATCGGTTG CTGGAGCAGG TGGATTCCCG AACCGGGCTG
GTTTTCGTGA CCAACCCGGA CAATCCGTCC GGCCACGCCG TACCTGCGGC GCAGCTTATG
GAACTCGCCC GGTCCCTGCC TCAGCAGTGT CTGCTGGTAG TGGACGAGGC GTATATTGAA
TTCGCCGAAA ACGGCATCTC CCCCCTGGCC GATTGGGATG CACACGGAAA TATCGTCTTG
CTGCGCACCT TTTCGAAACT CTATGGGCTG GCCGGTTTGC GCCTGGGCTA CGGGATCATG
CCGGATTGGT TGGCCGAGGC CGTACTGCGG ATCAAATTGC CCTTTAGCGT CAACCTGTTG
GCTGAAAAGG CAGGCGTCGC CGCGCTGGAA GACACCGCCT TCTATACCCG GACCAGGGAG
GTCGTTGGCG AGGGACGGCG TATCTTGAGC GCGGGGTTGC GGGAGTTGGG CTGCGAGGTC
AGTCCGTCAC AGGCCAATTT CCTCCTCTTC CGTCCCCCGA TGCCCGCGCG CGAGGTTTTC
GAGCGTCTGC TGGCCAAAGG GATCATCATC CGCCCCCTGA CCAGCTACGG CCTGGAAGAT
GCTCTGCGTG TCAGTGTCGG TACCGCTCAT GAGAATTCCA GATTCCTTGA AGCCATGCAG
GAGATAGTCC ATGCCCGCTA A
 
Protein sequence
MAKFSRVRPE IAGLKPYTPG LSIEEIKDRY ALTTVCKMAS NENPLGTSPL VQEALCRFAP 
YAFRYPRGGC PDLSAALAKV LGVPGECVVV GNGSDELIDL LIRTTLRPEK DNMVVFDPSF
SIYRMQATLC GVECRQVPLE QDLTFDFDRL LEQVDSRTGL VFVTNPDNPS GHAVPAAQLM
ELARSLPQQC LLVVDEAYIE FAENGISPLA DWDAHGNIVL LRTFSKLYGL AGLRLGYGIM
PDWLAEAVLR IKLPFSVNLL AEKAGVAALE DTAFYTRTRE VVGEGRRILS AGLRELGCEV
SPSQANFLLF RPPMPAREVF ERLLAKGIII RPLTSYGLED ALRVSVGTAH ENSRFLEAMQ
EIVHAR