Gene Daud_1626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1626 
Symbol 
ID6025454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1713694 
End bp1714761 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content63% 
IMG OID641594449 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001717760 
Protein GI169831778 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTCCA TAAATGAACT GGTGCGTTCG GAACTATTGG ACTTGCAGCC CTACCACGTC 
CCGGTATACC CTGGGATGAT CAAGCTGGAC GCCAACGAGA ACAACTATGA CTTTCCCGAG
CAAGTCCTGG AAGAGGTCTT AAGCACCATA GGTGGACAAA CCTTCGGGCG CTATCCGGAC
CCGTCGGCGC TGCAACTGCG GGAGTCCCTG TCCCGGTACA CCGGCGTGGA CCGAAACCGG
ATCACGGTCG GCAACGGATC GGACGAGCTG ATCCTGGACC TGATGCTGGC CTTCGCCGCC
GGGGGCAAGG TGGTGATCTG CACGCCGACA TTCACGATGT ATGAGATTCA CGCGGTCATC
GCGGGGGCCG AGCCGGTGGC CCTGCCCCGC AAGGCCGATT TCAGCGTGGA CCCGGACGCC
GTGATCGAGG CCGCCACCCG GCCCGGAGTG AAAATGGTGG TGCTGTGTTC TCCGAACAAC
CCGACCGGGA ACACCACCCC TTTGGAGGTG ACGGAGAAAA TCCTTAAGCA CACCAGGGCG
GTGGTGGTGC TAGACGAAGC CTACTACGAG TTCTGCGGCG AAACGGCCGT GTCCCTACTG
GACAAATACC CGCAGCTGGT GCTCTTACGC ACCTTTTCCA AGGCGTTCGG CTTGGCCGGG
CTCCGGCTGG GCTATATGCT GGCCGGACCG GAGGTGACTG GGATCATCCA ACGCGTCCAG
CAGCCCTTTA ACGTGAACGC CTTCACCCAA CAGGCCGCCA TCCGGGTGCT GGAATGGCAA
CCGCTGTTCG AGCGCCGTGT CCGGCAGATC TGCCGGTCGC GGGACGAGCT GTTCGCGGCG
ATGCGTTACC TGCCGGGGCT TACCGTCTAC CCGTCCCGGG CGAATTTCCT GCTGTTCCGG
ACCGAGATGC GCGCCCAGCA GGTGTTCAGC GGGCTTTTGC AGCGGGGTGT GCTGGTGCGC
CTGCTGGACC GTCCGGACCT GCCGAGCTGC CTGCGGGTGT CGGTCGGCCG GCCCGAGGAG
AACCTGATTT TTTGCAACCG GTTGGCTGAC GTGCTGCGGA CCGGGTAG
 
Protein sequence
MRSINELVRS ELLDLQPYHV PVYPGMIKLD ANENNYDFPE QVLEEVLSTI GGQTFGRYPD 
PSALQLRESL SRYTGVDRNR ITVGNGSDEL ILDLMLAFAA GGKVVICTPT FTMYEIHAVI
AGAEPVALPR KADFSVDPDA VIEAATRPGV KMVVLCSPNN PTGNTTPLEV TEKILKHTRA
VVVLDEAYYE FCGETAVSLL DKYPQLVLLR TFSKAFGLAG LRLGYMLAGP EVTGIIQRVQ
QPFNVNAFTQ QAAIRVLEWQ PLFERRVRQI CRSRDELFAA MRYLPGLTVY PSRANFLLFR
TEMRAQQVFS GLLQRGVLVR LLDRPDLPSC LRVSVGRPEE NLIFCNRLAD VLRTG