Gene Mbar_A3206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3206 
SymboltruD 
ID3627109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp4121622 
End bp4122938 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content47% 
IMG OID637702045 
ProducttRNA pseudouridine synthase D 
Protein accessionYP_306670 
Protein GI73670655 
COG category[S] Function unknown 
COG ID[COG0585] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00094] tRNA pseudouridine synthase, TruD family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.688587 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00519063 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAGTGC CGGAAATTGA AAAACAGATC GGAATAACTC TTTACTCCAC GGATACTGAC 
GGCCTCGGAG GGCAGCTTCG GCAGGAAGTG GAGGATTTTA TTGTTAAAGA AATTACAAAC
CGGGAGGAAG GACAGGAAGG GAAATACCTT ATTCTTGAAC TCATAAAGCG AGACTGGGAC
ACACATCACC TGACCCGGAC TCTTGCAAAA ATCCTCCAGG TAAGCCAGAA GCGAATCAGC
GTTGCAGGCA CAAAGGACAA GCGTGCACTT ACCACTCAGA AAATCAGTAT TTTTGACATT
GACGCCCAGA AAATTGAGAA GATCCATTTA AAAGATGTAG AGCTGAAAGT CCTTGGTCGC
TCCCGAAAAT CCGTTGAACT CGGAGACCTG TGGGGAAACA ATTTCAGAAT TACTATCCGG
AATATAACCC ACTCGAGTGA GGAGATACAC AAACTGCTCG AAAAGACCAC AAACGAGATC
CTGGCTCAGA ACGGAGTTCC GAACTTCTTC GGGATCCAGC GCTTTGGCTC GGTACGTCCT
GTCACGCATC TTGTAGGAAA AGCCATTGTT GAGGGAGATT TTGAAAAGGC TGCCCTGCTA
TATATTGCCG AACCCTTCCC TGATGAGCCA GAAGATACAC GGAAAGCCCG CCAGTTTGTT
AAAGAGACCC TCGATTTCAA AGAAGGCCTG AAAATCTATC CTCTCCACCT CGGGCATGAA
AGAGCAATGA TGAATCATCT GATCGCAAAC CCGGATGACT TTGCAGGAGC TTTTCTCGTT
CTCCCGAAAA ACCTTTACAG GATGTTTGTG CACGGCTACC AGTCCTACAT ATATAACATA
ATCCTGTGCA GGAGGATCGA AAAAGGCCTT TCCTTAAACG AGGCCGTAGA AGGTGATGTT
GTCTGTTTCA AGAATGAACA TGGCCTGCCG GATTCTTCAA AAACCGAAAA AGCCACCACA
GAAACCGTAA ATGCCATGAA CCGCCTGATT AAGAAAAAAC GGGCCTTCAT AACCGCTCCC
CTTCCGGGCC ATAATACAGA ATTTGCGTCA GGCATTCCTG GAGAGGTCGA ACAGGCAGTT
CTCGATGAAC TCAAAGTCCC TCTGCAAGGC TTCAATATAG AAGAAACTCC TGAGATGAGT
TCGAAAGGCA CCAGGCGGGA ACTCCTCCTT CAGGTTGAAC CGAAGTTTGA GGTTGCTGAA
GACGAACTGA ACCCGGGAAA GTTAAAAGCT GTGCTTGAGT TCATGCTGCC AAAGGGAAGT
TACGCAACAA CCGTGCTGCG GGAGTACATG AAGGTAGATC CTCTGCAAAT GAGTTGA
 
Protein sequence
MQVPEIEKQI GITLYSTDTD GLGGQLRQEV EDFIVKEITN REEGQEGKYL ILELIKRDWD 
THHLTRTLAK ILQVSQKRIS VAGTKDKRAL TTQKISIFDI DAQKIEKIHL KDVELKVLGR
SRKSVELGDL WGNNFRITIR NITHSSEEIH KLLEKTTNEI LAQNGVPNFF GIQRFGSVRP
VTHLVGKAIV EGDFEKAALL YIAEPFPDEP EDTRKARQFV KETLDFKEGL KIYPLHLGHE
RAMMNHLIAN PDDFAGAFLV LPKNLYRMFV HGYQSYIYNI ILCRRIEKGL SLNEAVEGDV
VCFKNEHGLP DSSKTEKATT ETVNAMNRLI KKKRAFITAP LPGHNTEFAS GIPGEVEQAV
LDELKVPLQG FNIEETPEMS SKGTRRELLL QVEPKFEVAE DELNPGKLKA VLEFMLPKGS
YATTVLREYM KVDPLQMS