Gene Dret_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1107 
Symbol 
ID8418932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1299650 
End bp1300594 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content58% 
IMG OID645037679 
Productaminotransferase class IV 
Protein accessionYP_003197973 
Protein GI258405231 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0639089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0333795 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCCGCA CCATCTCAGA CCAGAGCTAT CTTGAGGCCC TTTTGGCAGC CCCCCGGCCG 
GGAATCGATC AGGTACGGGC TTTCTATGAC CACCGCGTCG GTGTGATCGG GAAGGATCCC
CGGTATCTGC TTATTCCCAT GGATGACCAC TTGGTCCATC GGGGCGACGG GGTCTTCGAA
ACCCTGAAAT TTACTGCGAA GCGGTTGTAC CAACTTGATG CCCATGTCGA GCGCCTCTTC
CATTCCGCTA AGACCATCGC CATCCATCCT CCGTGTTCGC GAGAGGACGT TCGGGAGTTG
ATCATAGACC TTGCCGCGGC TTCAGAACTC GAAAACGGTA TCGTGGCTGT GTACGTGGGG
CGCGGCCCCG GCGGGTTTTC CGCAGATTTC CGGGAATGCC CCCAGCCCAG CCTGTACGGC
GTGGCCCGGA TCATGCCAGA GCGCCCGGAA GAGCTTTGGG AAAAGGGAGT GACCGCGTAC
ACGACCAGTT TCCCGGCCAA ACAATGCTAT CTGTCGCGGA TCAAGACCGT TGATTATCTC
CCCAATGTGC TCATGAAGCG TGAGGCCGTG CTCAAGGGAT ACGATTACCC CCTGTGTTTC
GACGAACAAG GGTTTTTGGC CGAAGGAGCC ACGGAAAATG TCTGCCTGGT CAATGCCTCG
GGCGAATTGA TCGTCCCGGA ATTGCGCAAT GCTTTACCTG GTACGACTCT ATTGCGCGGC
CTGGATCTCA TCCGACCGGA ACTGCCGGTT GAACACCGTT TAGTGAAAGA GGATGAACTC
TATCAGGCCA AGGAACTCAT TTTGCTGGGC ACCTCGTTGG ATGCCATCAG TGTGGTCCGT
TTTAACGGCC GGCCGATCCA CGATGTCCGG CCCGGACCGG TCAGCCGCCG TTTGCGGCAG
TTGTTGCGGG AAGACCAAGA GCGCAACGGG ACACCGATTC ATTGA
 
Protein sequence
MPRTISDQSY LEALLAAPRP GIDQVRAFYD HRVGVIGKDP RYLLIPMDDH LVHRGDGVFE 
TLKFTAKRLY QLDAHVERLF HSAKTIAIHP PCSREDVREL IIDLAAASEL ENGIVAVYVG
RGPGGFSADF RECPQPSLYG VARIMPERPE ELWEKGVTAY TTSFPAKQCY LSRIKTVDYL
PNVLMKREAV LKGYDYPLCF DEQGFLAEGA TENVCLVNAS GELIVPELRN ALPGTTLLRG
LDLIRPELPV EHRLVKEDEL YQAKELILLG TSLDAISVVR FNGRPIHDVR PGPVSRRLRQ
LLREDQERNG TPIH